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Preface to the Second Edition 


The first edition of Complex Analysis focused on generalising concepts from real analy- 
sis to the complex case. Where there were differences, we looked at the geometric 
picture to see why they were happening. This second edition does the same, but it also 
focuses on the increasing sophistication of mathematical ideas as we build from intuition 
to rigour, in a manner where greater understanding leads to more sophisticated intuitions 
and ways of working. New concepts and methods often start out in a technical way, with 
problematic aspects that conflict with intuition. As well as generalising real analysis, we 
move beyond it by addressing these conceptual conflicts, resolving them, and providing 
more sophisticated concepts and methods appropriate to complex analytic functions. 

This approach is used throughout the book. So, for example, the text now includes a 
short (but complete) discussion of the construction of a space-filling curve, to challenge 
our intuition about continuity and to explain why we have had to be careful with topo- 
logical assertions that appear obvious. The treatment here is simpler than most of the 
literature on space-filling curves. We have spent some time examining different notions 
of a path, especially the role of smoothness. 

We have added three new chapters. Chapter 15 introduces ideas about infinitesimals 
in real and complex analysis, thought of as variables that tend to zero, and formulated 
as elements of extensions of the real and complex fields. Chapter 16 gives a formal link 
from analysis back to geometric intuition, formulating and proving homology versions 
of Cauchy’s Theorem and Cauchy’s Residue Theorem. Chapter 17 outlines a few of the 
more advanced directions in which complex analysis has developed, and continues to 
evolve into the future. 

Chapter 15 has been added for the following reasons. Since the first edition appeared 
in 1983, the ways in which we operate mathematically have changed dramatically. Not 
only are there computers that perform numerical and symbolic operations at a speed 
way beyond that previously available to the individual mind; there are also interactive 
graphics drawn on high-resolution screens that let us visualise mathematical ideas in 
completely new ways. In particular, we can dynamically magnify pictures to see tiny 
detail that lets us represent ‘arbitrarily small’ quantities. 

This second edition therefore includes an extra chapter to introduce formally defined 
infinitesimals that lie in an ordered extension field K of the real numbers, which can 
be manipulated algebraically and visualised formally on an extended number line. This 
approach generalises to the complex case using the field K(i) where i? = —1, which 
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can be visualised in the extended complex plane. This construction offers a meaningful 
bridge between the epsilon-delta rigour of pure mathematics and the intuitive use of 
infinitesimals in applications. 

It can easily be shown that any proper ordered field extension K of the reals must 
contain infinitesimal elements x: that is, elements that are not zero yet satisfy |x| < r 
for all positive real numbers r. Using the completeness of the real numbers, we prove a 
simple theorem that any finite element of K has the form k = c + h, where c is real and 
h is infinitesimal or zero. A transformation in the form m(x) = (x — c)/e, where € is a 
positive infinitesimal, then lets us magnify infinitesimal detail near c and see it with our 
unaided human eyes in a real picture. This technique extends to the complex case in the 
field K(i). 

We can now illustrate why complex analysis is so different from real analysis. A 
differentiable complex function defined on an open set is locally expressible as a power 
series, and we may take K to be the smallest ordered extension field generated by a 
single infinitesimal ¢. The elements are power series )°,.,, ae" in € with possibly a 
finite number of terms in 1/¢, and each non-zero element has an order of infinitesimality 
n related to the first non-zero coefficient a, (where the element may be infinite if n 
is negative). Meanwhile a differentiable real function may be differentiable once but 
not twice, and this requires a much more sophisticated extension field K such as that 
given by the logical theory of non-standard analysis. While Gottfried Leibniz imagined 
infinitesimals of different orders, non-standard analysis fails to have this property and 
requires a much more sophisticated construction. At the end of the chapter we compare 
and contrast the various theories within a single framework. 

Chapter 16 on homology complements Chapter 9 on homotopy versions of Cauchy’s 
Theorem, and logically it could have been placed immediately after that. We postpone 
it to the penultimate chapter because we do not wish to delay the more practical payoff 
from Cauchy’s Theorem — Taylor and Laurent series, residues, evaluation of integrals, 
summation of series, and so on. 

Homology can be thought of as a way of characterising ‘holes’ in a topological space, 
which here is the domain of a complex function f. Singularities, where f is not differ- 
entiable, create such holes, and homology helps to describe the topological effect of 
singularities; for example, in the homology version of Cauchy’s Residue Theorem. To 
avoid including big chunks of algebraic topology, our approach to homology is based on 
step paths in open subsets of the plane, one of the main simplifying tools in this book. 
The proof is ‘bare hands’ and exploits the simple geometry of step paths and the abelian 
group structure of homology. 

Chapter 17 has been included to make it clear that complex analysis is still a major 
area of mathematical research. Complete though the classical theory may seem to be, 
there are numerous generalisations and new questions. The main topics mentioned are 
the Riemann Hypothesis, modular functions, several complex variables, complex man- 
ifolds, and complex dynamics — leading to the fractal geometry of Julia sets and the 
famous Mandelbrot set. 

In this new edition of Complex Analysis we have corrected all known typographical 
errors, simplifed some proofs, and reorganised the material in mostly harmless ways to 
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improve readability. We have brought the text and layout into line with current practice, 
and redrawn all the figures. Proofs, definitions, and examples are terminated with the 
‘end of proof’ symbol 0. The same symbol indicates the absence of a proof when the 


result is clear or has already been proved. Contrary to the prevailing wisdom, we do 
not insert punctuation marks at the end of displayed formulas. (Your tutors may object 
to this. Tradition is on their side. If they do, they can set you an extra exercise: insert 
all missing punctuation.) But it is now the twenty-first century. No one puts full stops 
(US: periods) at the end of book titles, or chapter or section headings. So why do this in 
displayed formulas, where it may cause confusion because punctuation marks are also 
often part of the symbolism? We suggest that clean typography should override pedantic 
punctuation. 

Formulas in the main text are another matter; here the absence of punctuation can 
cause confusion. We have followed tradition here. 


Online Supplementary Material 


Supplementary material including a concordance showing in more detail the changes 
between the previous edition and this one, and links to GeoGebra, can be found on the 
Cambridge University Press website: www.cambridge.org/Stewart&Tall2ed. 


Preface to the First Edition 


Students faced with a course on ‘Complex Analysis’ often find it to be just that — 
complex. In the sense of ‘complicated’. 

It’s true, of course, that the proofs of some of the major theorems in the subject can 
demand a certain technical versatility. But in many ways, on a conceptual level, complex 
analysis is actually easier than real analysis; it just isn’t always taught that way. 

This book is intended for use at the level of second or third year undergraduates, 
and it is based on experience accumulated from teaching such courses over the past 
decade. To exhibit the inherent simplicity of complex analysis we have organised the 
material around two basic principles: (1) generalise from the real case, and (2) when 
that reveals new phenomena, use the rich geometry of the plane to understand them. 
Our aim throughout is to encourage geometric thinking, with the proviso that it must be 
adequately backed by analytic rigour. 

The opening chapter sets the work in its historical context, and the history is often 
alluded to later as partial motivation. However, we feel that cultural changes often affect 
the status of conceptual problems: what was once an important difficulty can become 
a triviality when viewed with hindsight. It is not always necessary to drag today’s stu- 
dents through yesterday’s hang-ups. We argue the point at greater length below: it is 
fundamental to our entire approach. 


0.1 


The Origins of Complex Analysis, and 
lts Challenge to Intuition 


In a lecture in 1886, Leopold Kronecker asserted that the integers are made by God and 
all the rest is the work of Man (Gray [7]). If so, complex numbers are certainly one 
of humanity’s most intriguing mathematical artefacts. For centuries they have been a 
wonder to mathematicians and philosophers alike. It took nearly 300 years from their 
first appearance in Girolamo Cardano’s Ars Magna (The Great Art) to the publication 
of a formal definition that satisfies modern standards of rigour. Building on such foun- 
dations, the initiated reader might be forgiven for thinking that complex analysis must 
be an incredibly complicated theory. Yet here we come to a historical puzzle. Although 
it took nearly three centuries to obtain a satisfactory treatment of complex numbers, it 
then took less than a tenth of that time to complete a major part of complex analysis, 
which is far more sophisticated and extensive. 

Obviously the numbers must come first, or there is nothing to do analysis with, but 
the timescale is surprising. A possible explanation is that setting up the foundations 
adequately involved deep problems of a philosophical nature: it took a long time to come 
to grips with them, but once the ‘breakthrough’ had occurred, the further development 
was easy by comparison. 

History suggests otherwise. 


The Origins of Complex Numbers 


Cardano’s celebrated Ars Magna of 1545 is one of the most important early alge- 
bra texts. Diophantus’s Arithmetica of about 250 discussed the solution of equations 
and introduced a rudimentary form of algebraic notation. Muhammad al-Khwarizmi’s 
Al-kitab al-mukhtasar fi hisab al-gabr wa’ |-muqabala (The Compendious Book on Cal- 
culation by Completion and Balancing) appeared around 820. Its translation into Latin 
as Liber Algebrae et Almucabola gave us the word ‘algebra’. Al-Khwarizmi’s discussion 
was verbal, with no symbols but occasional diagrams. 

Cardano introduced a systematic algebraic notation, very different from what we use 
today. He used this to present the newly discovered solutions of cubic and quartic equa- 
tions. His book contained the solution of cubics discovered by Scipione del Ferro around 
1500, and independently by Niccolo Fontana (nicknamed ‘Tartaglia’, the stammerer) 
around 1535. The high point of the text is the solution of quartic equations found by 
Cardano’s student Lodovico Ferrari. The tangled tale of alleged duplicity and public 
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controversy that accompanied these discoveries can be found in Stewart [19, 20] and 
other historical sources. 
Ars Magna also discussed the simultaneous equations 
x+y=10 
xy = 40 
and obtained a solution (in modern notation) of the form 


x=S54¢V-15 y=5-V-15 


Cardano gave no interpretation for the square root of a negative number, but he did 
observe that, on the assumption that the quantities obey the usual algebraic rules, we 
can check that they satisfy the equations. His attitude to the discovery was dismissive: 
‘So progresses arithmetic subtlety, the end of which. ..is as refined as it is useless.’ 

In the same book he observed that applying Tartaglia’s formula to the cubic equation 


x= 15x+4 (0.1) 


leads to the solution 


r= yo4 Jaa + f2- Va 


in contrast to the obvious answer x = 4. 

In both instances there was a conflict between the intuition about numbers that math- 
ematicians had built up over the years, and the formal behaviour of the symbolic 
manipulations that Cardano was carrying out. It took centuries for mathematicians to 
extend the number concept and develop a refined intuition in which Cardano’s obser- 
vations make sense. The first step happened not long after, however. Raphael Bombelli 
(1526-73) suggested a way to reconcile the two solutions of (0.1) by manipulating the 
‘impossible’ roots as if they are ordinary numbers. Since 


2+/-1? =2+/-121 
Cardano’s expression becomes 


x=(2+VJ-1+(2-V-1D =4 


and the ‘impossible’ root is just the familiar root in a complex disguise. Bombelli’s work 
was the first hint that complex numbers can prove useful in solving real mathematical 
problems. But the message took a long time to sink in. 

In La Géometrie (1637), René Descartes made the distinction between ‘real’ and 
‘imaginary’ numbers, interpreting the occurrence of imaginaries as a sign that the prob- 
lem concerned is insoluble, an opinion shared by Isaac Newton at a later date. However, 
this view sits uneasily with Bombelli’s realisation that a formula involving complex 
numbers sometimes leads to a real solution, suggesting that the issue is not that simple. 

John Wallis [25] represented a complex number geometrically in his Algebra of 1685. 
On a fixed line the real part of the number was measured off (in the direction given by 
its sign); then the imaginary part was measured off at right angles, Figure 1. But this 
idea was largely forgotten. 
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Figure 1 Wallis’s geometric representation of a complex number. 


In 1702 John Bernoulli was evaluating integrals of the form 


i dx 
are + bx+e 


by partial fractions. Using the philosophy that complex numbers can be manipulated 
like real ones, he wrote the integrand as 


1 A B 


axt+bx+ce x-a x-f 


(using modern notation) where a, are the roots of the quadratic denominator, and 
obtained the integral in the form 


A log(x — a) + Blog(x — B) 


His bold decision to use the same method when the quadratic had no real solutions led to 
logarithms of complex numbers. But what were they? Both Bernoulli and Leibniz used 
the method, and by 1712 they were engaged in controversy. Leibniz asserted that the 
logarithm of a negative number is complex, while Bernoulli insisted it is real. Bernoulli 
argued that, since 

d(—x) dx 


Xx X 


it follows by integration that log(—x) = log(x). Leibniz, on the other hand, insisted 
that the integration was correct only for positive x. Once again, formal calculations that 
seemed sensible were in conflict with intuition. 

Leonhard Euler resolved the controversy in favour of Leibniz in 1749, pointing out 
that integration requires an arbitrary constant 


log(—x) = log(x) +c 


a point that Bernoulli had ignored. By formally manipulating expressions involving 
complex numbers, Euler derived a host of theoretical relations, including the famous 
formula of 1748: 

e” = cos@ +isind (0.2) 


Putting 0 = z we find 
e7 = —1 (0.3) 


The Origins of Complex Analysis, and Its Challenge to Intuition 


a fantastic relation that blends the three mathematical symbols e,i, and z in one 
surprising equation. The formula (0.3) is widely referred to as Euler’s formula, although 
he never published it explicitly. He did publish (0.2), of which it is a simple corollary, 
and this is also known as Euler’s formula. However, a formula equivalent to (0.2) had 
been found earlier by Roger Cotes in 1714. 

Extending the theory of logarithms to the complex case by defining 


log z = wif and only if e” =z 


we obtain other intriguing results. Formal manipulation gives 


log ztmmi = lB (etiyn =z- ery 


For an even integer m = 2n this gives 


elogc+2nmi =z 
So log z + 2n71 is also a logarithm of z: the complex logarithm is many-valued. For an 
odd integer m = 2n + 1 we have 


elog z+(2n+1)ri _ —z 


whence 
log(—z) = logz+ (Qn+ 1)z1 


This resolves the Leibniz—Bernoulli controversy: if x is real and positive, then log(—x) 
must be complex. 

As mathematicians refined their intuition to encompass complex numbers, everything 
started to fit together and make sense. The theory of complex numbers grew ever more 
fascinating. What was lacking was an interpretation that explained precisely what these 
entities are — a formal counterpart to the newly extended intuitions. 

In 1797 Caspar Wessel published a paper in Danish describing the representation of a 
complex number as a point in the plane. It went almost totally unnoticed until a French 
translation was published a hundred years later. Meanwhile the idea was attributed to 
Jean-Robert Argand, who wrote it up independently in 1806. Since that time the geo- 
metric interpretation of complex numbers has commonly become known as the Argand 
diagram. 

Another pioneer of the theory of complex numbers was Carl Friedrich Gauss. In his 
doctoral dissertation of 1799 he addressed a problem that had concerned mathemati- 
cians since the early eighteenth century. Initially it had been widely believed that, just 
as the solutions of real quadratic equations could lead to new ‘complex’ numbers, so 
would solutions of equations with complex coefficients lead to even more kinds of new 
numbers. But Jean d’ Alembert (1717-83) conjectured that complex numbers alone suf- 
fice. Gauss confirmed this in the ‘fundamental theorem of algebra’ — every polynomial 
equation has a complex root. At first he proved it in the purely real form that any real 
polynomial factorises into linear and quadratic factors, avoiding explicit use of imag- 
inaries; later he treated the general case. By 1811 he viewed the complex numbers as 
points in the plane, saying so in a letter to Friedrich Bessel. In 1831 he published full 
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details of his representation of complex numbers, which had begun to acquire an air of 
respectability. 

In 1837, nearly three centuries after Cardano’s use of ‘imaginary numbers’, William 
Rowan Hamilton published the definition of complex numbers as ordered pairs of real 
numbers subject to certain explicit rules of manipulation. (In the same year Gauss wrote 
to Wolfgang Bolyai that he had developed the same idea in 1831.) At last this placed the 
complex numbers on a firm algebraic basis. 


The Origins of Complex Analysis 


Unlike the gradual emergence of the complex number concept, the development of 
complex analysis seems to have been the direct result of the mathematician’s urge 
to generalise. It was sought deliberately, by analogy with real analysis. However, the 
mathematicians of the period tended to assume that everything in real analysis must 
automatically be meaningful in the complex case, so the main question must be how 
‘the’ complex version behaves. That there might not be a complex version, or several 
alternatives, was seldom appreciated, as the controversy over log(—x) illustrates. 

As noted above, there are early traces of analytic operations on complex functions in 
the work of Bernoulli, Leibniz, Euler, and their contemporaries. 

In his 1811 letter to Bessel, Gauss shows that he knew the basic theorem on com- 
plex integration around which complex analysis was subsequently built. In real analysis, 
when we integrate a function f between limits a and J, to get 


b 
/ f(odx 


the limits fully specify the integral. But in the complex case, where a and b represent 
points in the plane, it is also necessary to specify a definite path from a to b, and to 
‘integrate along the path’. The question is: to what extent does the value of the integral 
depend on the chosen path? 

Gauss says: 
I affirm now that the integral f f(x)dx has only one value even if taken over different paths, 


provided f(x)... does not become infinite in the space enclosed by the two paths. This is a very 
beautiful theorem whose proof... I shall give on a convenient occasion. 


It seems the occasion never arose. The crucial step of publishing a proof of this result 
was taken in 1825 by the man who was to occupy centre stage during the first flowering 
of complex analysis: Augustin-Louis Cauchy. After him, this result is called ‘Cauchy’s 
Theorem’. In Cauchy’s hands the basic ideas of complex analysis rapidly emerged. For a 
complex function to be differentiable, it must have a very specialised nature: its real and 
imaginary parts must satisfy certain properties called the Cauchy—Riemann Equations. 
He showed that contour integrals of differentiable functions have the property noted 
privately by Gauss. Further, if an integral is computed along a path that winds round 
points where the function becomes infinite, Cauchy showed how to compute this integral 
using the ‘theory of residues’. The latter requires no more than the calculation of a 


0.3 


The Origins of Complex Analysis, and Its Challenge to Intuition 


constant, called the ‘residue’ of the function, at each exceptional point, and knowing 
how many times the paths winds around that point. The precise route of the path does 
not matter at all — only how it winds round these exceptional points. 

Power series turned out to be important in the theory, and other workers extended 
these ideas. Pierre-Alphonse Laurent introduced ‘Laurent series’ involving negative 
powers in 1843. In this formulation, near an exceptional point zo, a differentiable 
function is expressed as a sum of two series 


f() = [ao + a (z — 20) +++ + an(zZ — 20)" +-°°] 
t[by(z— zg) ++ + a_g(z— Zo) "+: ] 


The residue of f(z) at z = Zo is then just the coefficient b,. Using the theory of residues, 


the computation of complex integrals often proved to be far simpler than could ever have 
been dreamed. 

Cauchy’s definition of analytic ideas such as continuity, limits, derivatives, and so on, 
were not the same as those we use today. He based them on infinitesimal notions, which 
fell into disrepute in the late nineteenth century — though recent developments in ‘non- 
standard analysis’, and a new theory we present in Chapter 15, show that we may have 
been over-hasty in judging Cauchy’s ideas. Moreover, Cauchy’s concept of ‘infinitesi- 
mal’ was a variable quantity that approaches zero as closely as we please, not a fixed 
quantity. See Tall and Katz [24] for detailed discussion and educational implications. 

A rigorous treatment was devised by Karl Weierstrass (1815-97) using definitions 
which are still regarded as fundamental, the ‘epsilon-delta’ formulation. Weierstrass 
founded his whole approach on power series. However, the geometric viewpoint was 
sorely lacking in his work (at least as published). This deficiency was remedied by far- 
reaching ideas introduced by Bernhard Riemann (1826-66). In particular, the concept of 
a ‘Riemann surface’, which dates from 1851, treats many-valued functions by splitting 
the complex plane into multiple layers, on each of which the function is single-valued. 
The crucial feature is how the layers join up topologically. 

From the mid-nineteenth century onwards, the progress of complex analysis has 
been strong and steady, with many far-reaching developments. The fundamental ideas 
of Cauchy remain, now refined and clothed in more recent topological language. The 
abstruse invention of complex numbers, once described by our mathematical forebears 
as ‘impossible’ and ‘useless’, has become part of an aesthetically satisfying theory with 
eminently practical applications in aerodynamics, fluid mechanics, electronics, control 
theory, and many other areas. 

Since the first edition of this book, formal theory has also evolved so that Cauchy’s 
ideas of infinitesimals can be visualised as points on an extended number line, which we 
describe in our new Chapter 15. 


The Puzzle 


We return to our historical puzzle. Why was the development of complex numbers so 
laboured and hesitant, whereas that of complex analysis was explosive? We suggest 
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a possible answer (only personal opinion and thus open to dispute). It is somewhat 
different from the ‘foundations + breakthrough’ explanation offered earlier. 

Looking at the early history of complex numbers, the overall impression is of count- 
less generations of mathematicians beating out their brains against a brick wall in search 
of — what? A triviality. The definition of complex numbers as ordered pairs of points 
(x,y), or as points in the plane, was obtained over and over and over again. It is even 
implicit in Bombelli’s work; it is there for all to see in Wallis’s; it crops up again by way 
of Wessel, Argand, and Gauss. Morris Kline remarks on page 629 of [11]: 


That many men — Cotes, de Moivre, Euler, and Vandermonde — really thought of complex 
numbers as points in the plane follows from the fact that all, in attempting to solve x” — 1 = 0, 
thought of solutions ... as the vertices of a regular polygon. 


If the problem has such a simple solution, why was this not recognised sooner? 

Perhaps the early mathematicians were not so much seeking a construction for com- 
plex numbers as a meaning, in the philosophical sense: ‘what are complex numbers?’ 
However, the development of complex analysis showed that the complex number con- 
cept was so useful that no mathematician in his right mind could possibly ignore it. The 
unspoken question became ‘what can we do with complex numbers?’, and once that 
had been given a satisfactory answer, the original philosophical question evaporated. 
There was no jubilation at Hamilton’s incisive answer to the 300-year old foundational 
problem — it was ‘old hat’. Once mathematicians had woven the notion of complex 
numbers into a powerful coherent theory, the fears that they had concerning the exis- 
tence of complex numbers became unimportant, because mathematicians lost interest in 
that issue. 

There are other cases of this nature in the history of mathematics, but perhaps none is 
more clear-cut. As time passes, the cultural world-view changes. What one generation 
sees as a problem or a solution is not interpreted in the same way by a later generation. 
It is worth bearing this in mind when thinking about the historical development of math- 
ematics. To interpret history solely from the viewpoint of the current generation may 
easily lead to distortion and misinterpretation. 

What this explanation omits is any discussion of why mathematicians lost interest in 
the meaning of complex numbers. And that leads to a question that sheds a different 
light on the historical development, which we now discuss. 


Is Mathematics Discovered or Invented? 


Students trying to understand new concepts are in a similar position to the pioneers who 
first investigated them. At any stage in our education, we build not just on our current 
knowledge, but on a variety of beliefs and intuitions that are often vague, and may not be 
consciously recognised. As a trivial example, children familiar with counting numbers 
may find it hard to adapt their thinking to negative numbers, or rational numbers. When 
faced with questions like ‘what is 3 minus 7?’ or ‘what is 3 divided by 7’, intuition 
based solely on whole numbers leads to the answer ‘can’t be done’. That makes it hard 
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to understand —4 or 3/7. In fact, these is not really trivial examples, because the world’s 
top mathematicians, centuries ago, were just as confused by the question ‘what is the 
square root of minus one?’ Even their terminology — ‘imaginary’ — reveals how puzzled 
they were. Intuitively they considered numbers to be ‘real’ — not in the sense we now 
use to distinguish real from complex, but as direct representations of real measurements. 
The new objects behaved like numbers in many ways, but they seemed not to correspond 
directly to reality. 

In such circumstances, it can be tempting to discard existing intuition completely. 
But it is more sensible to adapt the intuition to fit the new circumstances. It is much 
easier to do arithmetic with negative numbers or fractions if you remember how to do 
it with whole numbers; it is much easier to do algebra with complex numbers if you 
bear in mind how to do it with real numbers. So the trick is to sort out which aspects 
of existing intuition remain valid, and which need to be refined into a broader kind of 
understanding. 

One way to approach this issue is to take seriously a question that is often asked but 
seldom answered satisfactorily: is mathematics discovered or invented? One answer is 
to dismiss the question, and agree that neither word is entirely appropriate; moreover, 
they are not mutually exclusive. Most discoveries have elements of invention, most 
inventions have elements of discovery. Galileo would not have discovered the moons 
of Jupiter without the invention of the telescope. The telescope could not have been 
invented without discovering that sand could be melted to make glass. 

But leaving such quibbles aside, we can make a rough distinction between discov- 
ery, which is finding something that is already there but has not hitherto been noticed, 
and invention, which is a creative act that brings into being something that has not pre- 
viously existed. There is a case to be made that in this sense, mathematicians invent 
new concepts but then discover their properties. For example, complex integration is all 
about ‘paths’ in the complex plane. Intuitively, a path is a line drawn by moving the 
hand so that the pencil remains in contact with the paper — no jumps. We might choose 
to formalise this notion as a continuous curve — the image of a continuous map from a 
real interval to the complex plane. We might be interested in how the pencil point moves 
along this curve, which requires the map itself, not just its image. Sometimes we might 
wish the path to be smooth — to have a well-defined tangent. 

As it happens, we need all of these notions. Intuitively, they are all based on the 
same mental image. Formally, they are all very different. They have different definitions, 
different meanings, and different properties. A smooth path always has a meaningful 
length, for instance; a continuous path may not. The definitions we settle on in this book 
fit conveniently into the standard ideas of analysis, but they are not built into the fabric 
of the universe. We chose them, and by so doing we invent concepts such as ‘path’, 
‘curve’, and ‘smooth’. 

On the other hand, once a concept has been invented, we cannot invent its properties. 
When we also invent the concept ‘length’, we discover that every smooth path has finite 
length. We cannot ‘invent’ a theorem that the length of a smooth path can be infinite. If 
we weaken ‘smooth’ to ‘continuous’, however, we can discover that infinite lengths are 
possible; indeed, ‘length’ need not have a sensible meaning at all. In short: invention 
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opens up new mathematical territory, but exploring it leads to discoveries. We may not 
know what things are present in the territory, but we do not get to choose them. 

Sometimes — in fact, very often — we discover that our inventions have features that 
we neither expected nor intended them to have. We discover, perhaps to our dismay, that 
the image of a smooth path can have a right-angled corner, see Section 6.7. We did not 
expect that: a corner does not feel ‘smooth’. But its possibility is a direct consequence 
of the definition we invented. 

When this kind of thing happens, we have two choices. Accept the surprises as the 
price for having a nice, tidy definition; or rule them out by changing the definition — 
inventing a more comfortable alternative. In practice we often do both, by giving the 
alternative a different name. Here we could (and do) define a ‘regular path’ to be a 
smooth path y : [a,b] — C for which y’(t) 4 0 whenever t € [a, b]. Now the image 
cannot have a sharp corner. On the other hand, every theorem about regular paths must 
now take account of the consequences of that extra condition. We also have to remember 
that some theorems may be valid for regular paths but not for smooth paths, and so on. 

As we move from intuitive ideas to formal ones, we also refine our intuition so that 
it matches the formal theory better. Formal calculations start to make sense, not just 
as strings of symbols that follow from previous strings, but as meaningful statements 
that agree with our new intuitions. From this point of view, the history of complex 
analysis is the story of intuition co-evolving with an increasingly formal approach. This 
suggests that mathematicians lost interest in the meaning of complex numbers when 
they incorporated them into their intuitive assumptions and beliefs. With the apparent 
conflicts resolved by these refined intuitions, they were free to push the subject forward, 
no longer worried that it did not make logical sense. 

When a mathematical area ‘settles down’ into a mature theory, there is a broad con- 
sensus that certain concepts provide the most convenient route through the material. 
These concepts then become standard — things like ‘continuous’, ‘connected’, and so 
on. They get taught in lecture courses and printed in books. We may start to feel that 
the standard definitions are the only reasonable ones. Even so, we are always free to 
work with different concepts if that seems sensible, or even to modify definitions while 
retaining the same name — though that can be dangerous. Today’s concept of continuity 
is quite different from what it was in the time of Euler, but we use the same word; we 
just bear in mind that it now has a specific technical meaning. A historian reading Euler 
would need to be on their guard. 

It is also worth remarking that many mathematical concepts seem more natural to us 
than others. Counting numbers are very natural (we even call them the ‘natural num- 
bers’). The number i was baffling for centuries (and was called ‘imaginary’ as a result). 
Our culture, our society, and even our senses, predispose us towards certain concepts. 
Euclid’s points and lines correspond to early stages of the processing of images sent 
from the retina to the visual cortex. Newton’s concept of acceleration being related to an 
applied force reflects the way our ears sense accelerations and make us ‘feel’ a push — 
a force. 

It then becomes easy to imagine that mathematics somehow already exists in a realm 
outside the natural world. Even if humans invented numbers, in retrospect they seem 
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such a natural idea that surely they were just hanging around waiting to be invented. 
If so, that is more like discovery. This view is often called Platonism: the idea that 
mathematical concepts already exist in some ideal form in some kind of world outside 
the physical universe, and mathematicians merely discover how these ideal forms work. 
The contrary view is that mathematics is a shared human construct, but that construct is 
by no means arbitrary, because every new invention is made in the context of existing 
knowledge, and every new discovery must be logically valid. 

A major theme of this book is that many apparently puzzling aspects of complex 
analysis can be made more intuitive by paying attention to the geometry of the complex 
plane (in a broad sense, including its topology). This brings one of the human brain’s 
most powerful abilities, visual intuition, into play. For this reason, we draw a lot of pic- 
tures. However, a picture, and our visual intuition, can be misleading unless we examine 
the unstated assumptions that they involve. By doing so, we can refine out intuition and 
make it more reliable. For this reason, we do not just introduce important definitions and 
then deduce theorems that refer to them. We try to relate those definitions to intuition, 
to make the proofs easier to understand. Then we exhibit some of the positive results 
that arise, to convince you that the new concept is worth considering. And then... we 
show you that sometimes the formally defined concept does not behave the way intuition 
might suggest. Sometimes it turns out to be useful to strengthen the definition so that 
it matches intuition more closely. Sometimes we refine our intuition so that it matches 
the formal definition. Sometimes we can even do both, in which case we have to make 
some careful but useful distinctions. 

The historical events sketched earlier in this chapter offer many examples of this 
process. The square root of minus one went from being a puzzling idea that seemed to 
have no meaning to one of the most important concepts in the whole of mathematics. 
Along the way, mathematicians’ intuition for ‘number’ underwent a revolution. We can 
now to some extent short-circuit the historical debates — what were hang-ups then need 
not be hang-ups now — but when a new idea puzzles us, and doesn’t seem to make sense 
until we finally sort it out, it is helpful to remember that the mathematical pioneers often 
experienced exactly the same feelings, for much the same reasons. 


Overview of the Book 


It is often useful to set the development of a mathematical theory in its historical con- 
text, but it is not always necessary to fight the historical battles again. In this text we give 
honour where we can to those pioneers who carved their way through uncharted math- 
ematical territory. But more recent developments let us see the theory itself in a new 
light. To the modern ear the very name ‘complex analysis’ carries misleading overtones: 
it suggests complexity in the sense of complication. The older meaning, ‘composite’, 
was perhaps appropriate when the ‘real part’ of a complex number had a quite different 
status from that of the ‘imaginary part’. But nowadays a complex number is a perfectly 
integrated whole. To think of complex analysis as if it were, so to speak, two copies of 
real analysis, is to place undue emphasis on the algebra at the expense of the geometry, 
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which in the long run has been far more influential. And in fact complex numbers are not 
more complicated than reals: in some ways, they are simpler. For instance, polynomials 
always have roots. Likewise, complex analysis is often simpler than real analysis: for 
example, every differentiable function is differentiable as often as we please, and has a 
power series expansion. 

In preparing our approach to the subject we have adopted two basic organising 
principles. The first is the direct generalisation of real analysis to the complex case. 
Definitions, of limits, continuity, differentiation, and integration are natural extensions 
of the corresponding real notions. Since nowadays any student taking a course in com- 
plex analysis may be assumed to have made a study of the real counterpart, many battles 
have already been won. We can refer students to their accumulated knowledge, paus- 
ing only to phrase it appropriately. This saves time and energy, allowing us to proceed 
straight to the heart of the subject, where the interesting differences occur. Invariably 
this happens because the plane has a richer geometry than the line, and this leads to our 
second major organising principle: geometric insight is valuable and should be culti- 
vated. Of course this insight must be translated into sound formal arguments; this can 
often be done using modern topological notions. 

From these two principles, a straightforward approach to complex analysis emerges. 
First, complex numbers are defined formally as ordered pairs of real numbers, giv- 
ing them a geometric interpretation as points in the plane. The topology of complex 
numbers is then a natural consequence of plane topology. In quick succession it is 
possible to derive complex generalisations of the notions of continuity, limits, and 
differentiation, with particular emphasis on power series, which play a central role 
later. A study of the complex exponential function, defined by the usual power series, 
reveals the intimate connection between this function and the trigonometric functions 
(also considered as power series). After generalising the notion of integration, the 
logarithm can be viewed either as the inverse function of the exponential, or as the 
integral 

dz 
logz= {| — 
z 
suitably interpreted. Either approach has to deal with the multivalued nature of the com- 
plex logarithm. This arises because the complex exponential has period 2771, so cannot 
be one-one. Resolving these issues involves close links between geometric intuition and 
formal analysis. 

At this stage Cauchy’s Theorem is presented in various guises, and the use of inte- 
gration leads to a proof that every differentiable function can be expressed as a power 
series. More generally, Laurent series (using positive and negative powers) take care 
of isolated points where functions become infinite, and lead to the powerful ‘theory 
of residues’ for calculating complex integrals, summing series, and counting zeros of 
equations in a given region of the complex plane. 

Returning to geometric ideas, complex analysis has many practical applications. 
Today it is widely used by physicists and engineers, in many different contexts. In partic- 
ular, it has proved invaluable in two-dimensional potential theory. The geometric ideas 
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of Riemann can be viewed in terms of modern topology, to give a global insight into 
‘many-valued’ functions (such as the logarithm) and open up new areas of progress. 

In this second edition of the book, we continue by presenting a formal set-theoretic 
approach to infinitesimals that has evolved since the first edition was published 35 years 
ago. It offers a new vision of complex analysis that includes both the analytic epsilon- 
delta approach of Riemann and the infinitesimal ideas of Cauchy in a broader overall 
theory. 

Next, we revisit Cauchy’s Theorem in the context of homology theory. Homology is 
a topological property of the domain of the function, and it detected the presence of 
holes. These holes are obstacles that cause integrals of complex functions to depend on 
the chosen path. Using step paths, we reformulate complex integration over ‘cycles’ in a 
domain. These are formal integer combinations of closed loops, so they form an abelian 
group. The subgroup of ‘boundaries’ has the property that the integral of any continuous 
function over a boundary is zero. So the difference between cycles and boundaries con- 
trols how integrals depend on the choice of path. The corresponding algebraic object is 
the quotient group of the group of cycles modulo the subgroup of boundaries, and this is 
the (first) homology group of the domain. It provides a formal algebraic interpretation of 
how integrals depend on the choice of path. This chapter provides a gentle introduction 
to homology in its simplest (old-fashioned) form, though even this approach requires 
some mathematical sophistication. The topological ideas shed light on the general area 
surrounding Cauchy’s Theorem. 

Finally, to show that complex analysis is still alive and kicking in the modern era, 
Chapter 17 provides a simplified overview of a few more recent developments. These 
include the still-unsolved Riemann Hypothesis, modular functions, generalising com- 
plex analysis to several variables (where strange new phenomena occur), to complex 
manifolds (multidimensional ‘surfaces’ with a complex structure, generalising Riemann 
surfaces), and the iteration of complex maps, or complex dynamics, which leads to 
remarkable fractal structures such as Julia sets and the Mandelbrot set. 


1.1 


Algebra of the Complex Plane 


“The Divine Spirit found a sublime outlet in that wonder of analysis, that portent of the 
ideal world, that amphibian between being and not-being, which we call the imaginary 
root of negative unity.’ So said Leibniz in 1702 — though he may have let his eloquence 
run away with him. The current view of ,/—1 is a little more prosaic, though the uses 
made of it are at least as inspiring. The logical status of complex numbers, which caused 
so much distress during the eighteenth century, is now seen to be very much on a par 
with that of the ‘real’ numbers. What puzzled the ancients was the obvious artificiality 
and abstraction of the complex number system, in contrast to the apparently natural and 
concrete real number system. But the mathematician of today sees even real numbers as 
possessing a similar artificiality and abstraction. 

In this chapter we discuss the construction of a system of numbers that contains the 
familiar real numbers and permits the solution of the equation x7 = —1. This system 
is known as the complex numbers. Many readers will already know the contents of this 
chapter: they should read it through rapidly to check such items as notation, and pass on 
at once to the next. 

There is a natural geometric representation of complex numbers as a plane, analogous 
to that of the reals as a line. The extra freedom inherent in the plane gives the whole 
subject a very geometric flavour, which it is our intention to keep to the fore in the 
development of the theory. 


Construction of the Complex Numbers 
We begin with the definition that emerged from the insights of Wallis, Wessel, Argand, 
Gauss, and Hamilton: 


DEFINITION 1.1. A complex number is an ordered pair (x, y) of real numbers. Addition 
and multiplication of complex numbers are defined by: 


(x1,¥1) + (x2, v2) = (41 + X2,91 + y2) (1.1) 
(x1, v1 (x2, y2) = (1x2 — Yi y2, X1y2 + x2y1) (1.2) 


For example, 


(3,5)2,7) =@G3-2-—5-7,3-7+5-2) = (—29,31) 


Algebra of the Complex Plane 


This definition is the culmination of several centuries of struggle to understand complex 
numbers, and it shows how elusive a simple idea can be. Before we see what these pairs 


have to do with ./—1, however, let us establish some of their properties. 


THEOREM 1.2. The set of complex numbers, with the operations defined by (1.1, 1.2), 
is a field. That is, the following axioms hold: if z1 = (x1, y1), 2 = (%2, y2), and 3 = 


(x3, y3) are complex numbers, then 
(a) Addition and multiplication are commutative: 
Z1+22=22 721 
£122 = 2221 
(b) Addition and multiplication are associative: 
(iF 22) +23 = 2 ae + 3) 
(z122)23 = 21 (2223) 


(c) There is an additive identity (0,0): 


z1 + (0,0) = z1 
(d) There is a multiplicative identity (1,0): 
z1(1,0) = % 


(e) Each element has an additive inverse: 


(x, y) aa (—x, —y) = (0, 0) 


(f) Each element other than (0,0) has a multiplicative inverse: 


sap ree) O 
(x,y x ey?” x* 4+ y* ( > 


(g) Multiplication distributes over addition: 


Z1(z22 + 23) = 7122 + 2123 


(1.3) 


(1.4) 


(1.5) 


(1.6) 


(1.7) 


(1.8) 


(1.9) 


Proof. All assertions (a)—(g) are direct consequences of (1.1) and (1.2), using only the 


field properties of the set R of real numbers. For example, (1.9) holds because 


Z1(z2 + 23) = (11,1) (2 + 3,2 + y3) 


= (x1 (x2 +.x3) — yity2 + y3),x10y2 + 3) + y1 x2 + 23) 
= (4X2 +.X1x3 — Y1V2 — Y1V3,X1 2 + X13 + WX2 + Y1x3) 


and 


2122 + 2123 = (%1,1)(2, y2) + (11,1) 03, y3) 


= (x1x2 — y1y2,x1y2 + 1X2) + (%1.x3 — y1y3, X13 + 1X3) 
= (x1x2 — yiy2 + 1x3 — y1y3,X1y2 + yix2 + x1y3 + y1%x3) 


which, by real algebra, is the same ordered pair. 


The reader should supply similar proofs for the remaining assertions. 


The symbol C is used for the field of complex numbers. 
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The x + iy Notation 


The symbol commonly used for a complex number is not (x, y) but x + iy. This notation 
goes back to Euler, who used i to denote J/=T in 1777, though the notation was first 
used consistently by Gauss. 

To recover this notation, we proceed as follows. First note that since 


(1,0) + (2,0) = (41 + X2, 0) 
(x1, 0)(x2, 0) = (4122, 0) 
we may identify a complex number (x), 0) with the real number x;. More pedantically 


the map (x1,0) +> x; defines an isomorphism between the set of complex numbers of 
the form (x1, 0) and the field R of real numbers. Now define 


i= (0, 1) 
Then 


x +iy = (x, 0) + (0, LG, 0) 
= (x,y) by (1.1) and (1.2) 


Finally, observe that 


i? = (0, 1)(0, 1) 
=(0-0—1-1,0-1+41-0) 
= (—1,0) 
=-] 


In this sense, we may say that i = ./—T. 

The x + iy notation is more convenient, and will be used from now on. (Sometimes 
we use x + yi instead. By (1.3) this represents the same number. In particular, we use 
this form when x, y are specific real numbers, because 1 + 2i looks more sensible than 
1+ 12.) 

Algebraic computations in this notation are easy. They use all the normal algebraic 
rules, plus the rule i? = —1. So to multiply, we work out 


(x1 + iy (x2 + ly2) = x1x2 + x1 1y2 + ty1x2 + iyi ty2 


= x4Xq +i(qy2 + x2) +P yy 


But i2 = —1, so this becomes 


x1xX2 — yiy2 +11 y2 + y1x2) 


This computation, of course, explains the choice of the multiplication formula (1.2). 
The addition formula (1.1) comes the same way but is easier. The definition by (1.1) 
and (1.2) is thus a very sneaky piece of hindsight. 

The formula (1.8) for inverses may also be derived as follows: 


1 1 x-iy_ x-iy 


x+iy xt+iyx-iy x+y¥ 
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i . 
in the form x + ty. 


2 
Example 1.3. Express i 
We have 


2h 2 Raa 26+ -4A+3). 8 oi 


1421 1+4+211-2i 5 D3 


A Geometric Interpretation 


Since ordered pairs (x, y) provide coordinates in the plane R*, we can visualise C as a 
plane, with the number x + iy corresponding to the point (x, y) as in Figure 1.1. 

The identification of (x,0) with x € R then amounts to considering the real numbers 
as forming the real axis in the plane, as in Figure 1.2. 

The y-axis, at right angles to this, is the imaginary axis. 

This geometric representation is often called the Argand diagram or the Gauss plane. 
Since so many other mathematicians (especially Wessel) have justifiable claims to it, we 
avoid the danger of giving undue credit to any of them by referring to it as the complex 
plane. In purely geometric terms, of course, it is just the real plane R?, but interpreted 
as C it has the additional algebraic structure of a field, not just a vector space over R. It 
is this extra structure that gives the complex plane its special qualities. 


Figure 1.1 Visualising a complex number as a point in the plane R?. 


Figure 1.2 Identifying x+ Oi e C withx ER. 


1.4 


1.5 


1.5 The Modulus 17 


Real and Imaginary Parts 


Given a complex number z = x + iy, we call x the real part of z and y the imaginary 
part, using the notation 


x = re(z) 
y = im(z) 


Both are real numbers: the coordinates of z in the complex plane. 


The Modulus 


The modulus, or absolute value, of a real number x is defined to be 


Be x ifx>0 
m=) x ifs <0 


As it stands, there is no obvious generalisation to complex numbers, because (see 
Section 1.8 below) there is no useful ordering on C. However, we can interpret |x| geo- 
metrically as the distance from x to the origin of the real number line. This translates 
directly to the complex plane, leading to the definition 


lel =? +y? 


for the modulus, or absolute value, of a complex number z = x + iy. Here we mean the 
positive square root: since x” + y* is always a positive real number, the formula defines 
|z| as a real number. 


THEOREM 1.4. The modulus has the following properties: 


Iz1 + za] < |zal + Iz2I (1.10) 
121Z2| = [z11Izal (1.11) 
[lz1| — lzall = lz1 — zal (1.12) 


Proof. Property (1.11) follows at once from the definitions. The triangle inequal- 
ity (1.10) is a little harder to prove directly, although its geometric interpretation 
(Figure 1.3) is the obvious fact that one side of a triangle is no longer than the sum 
of the lengths of the other two sides. 

To prove (1.10) note that, since both sides are positive, it is equivalent to 


(zi + zal)” < (zal + Izal)? 
which takes the form 


(x1 to +01 +y2) < lel? + 2ler|lzol + lel? 


where z] = x1 + 11, Z2 = %2 + iy2. Simplifying, this holds if and only if 


X1x2 + y1y2 XS |z1]|22| 
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Figure 1.3 Geometry for the triangle inequality. 


Since the right-hand side is positive, we may square again, and the desired inequality 
follows from 


(x2 +192) < lallel? 


But 


[zal7|zol* — (xix2 + yiya)? = xt + y)O5 + y3) — Gerx2 + y1y2)” 


= (x12 — x1)" 


which is positive. 
Property (1.12) is a consequence of (1.10). This implies that 


[21 — Z| + [Z| = lz1 
so 
lei = z2| 2 fe1| —'|z2| 
Swapping z; and z2 we also have 
zi — 22| = |z2 — zal = |z2| — Izl 


Combining the two inequalities yields 


llzi] — |z2|| < |z1 — zal 


1.6 The Complex Conjugate 
If z= x-+ iy, its complex conjugate is 
Z=x-ly 


Geometrically, this is obtained by reflecting z in the x-axis, Figure 1.4. 
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Figure 1.4 Geometry for the complex conjugate. 


The following properties are easy to verify directly: 


WZewa=H=Aa+Hu (1.13) 
fica = 2100 (1.14) 
re(z) = (2 +2) (1.15) 
im(z) = +(z —2) (1.16) 

|zl? = zz (1.17) 
Zz € Rif and only if z = z (1.18) 


Properties (1.13, 1.14) have the important implication that the complex conjugate of 
any polynomial expression in complex numbers z), 72, . . . , Zn can be obtained by writing 
a bar over each individual coefficient or variable in the expression. This is easily proved 
by induction. For example, 


52122 — 237 + 2iz = 5 —B! +2iz 
= -~7 i 
= 57122 —73° — 212] 


since 5, 2 are real and i is imaginary, so 5=5,2=2i=-i. 


Polar Coordinates 


The expression x++iy for a complex number is intimately related to Cartesian coordinates 
(x,y) in the plane. It turns out often to be useful to work with polar coordinates (r, 0), 
which we recall correspond to a point distance r from the origin making an angle 0 
measured from the positive x-axis in an anticlockwise direction, Figure 1.5. Of course 
we measure 6 in radians. These coordinate systems are related as follows: 


x rcos@ 
y = rsind 


(1.19) 
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Figure 1.5 Polar coordinates. 


Therefore 


r= fe+y=t 


Finding @ is slightly trickier because it is not unique. Any value of 6 for which (1.19) 


where z = x + ly. 


holds is called an argument of z. The article ‘an’ is used to reflect the lack of uniqueness: 
if 9 is an argument then so is 6 + 2k for any integer k. With the understanding that 0 
is unique only up to multiples of 277, we may use the notation 


0 = argz 


Often the choice of 6 is rendered unique by imposing some convention: for example, 
we may insist that 0 is chosen in the interval [0, 277), or in (—7, 2]. The unique value of 
@ in the interval (—z, 7] is known as the principal value of the argument. (We follow 
standard practice in taking this particular interval. Its main advantage is that @ then 
behaves nicely near the positive real axis, where 0 = 0. But this is a technical point that 
only acquires importance much later. The non-uniqueness of 6 is a phenomenon with 
tremendous ramifications in the theory, as we shall see.) 
With r, 0 defined as above, 


zZ=x+iy =r(cos@é + ising) 


The expression cos @ + isin@ is of considerable importance in complex analysis. In 
Chapter 5 we relate it to the complex exponential function. 


The Complex Numbers Cannot be Ordered 
The real numbers may be given an ordering (the usual one, >) which has among its 
properties the following: 


If x # 0 then either x > O or — x > 0, but not both (1.20) 
Ifx,y > Othenx+y>0,xy >0 (1.21) 
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No such ordering can be defined on the complex numbers. Suppose for a contradiction 
that one can. Since i 4 0, (1.20) implies that either i > 0 or —i > 0. Then (1.21) implies 
that either —1 = 1-1 > 0 or —1 = (—i): (i) > O. At the same time, | = (iy > 0. 
But then both 1 and —1 are greater than 0, contrary to (1.20). 

It is therefore not possible to use inequalities, analogous to those for reals, when 
discussing complex numbers. Any inequality that occurs must involve only real 
numbers, possibly related to the given complex numbers. For example, if z € C then 


z>1 
makes no sense, but either of 
lz| > 1 
or 
re(z) > 1 


is acceptable. (They do not mean the same thing!) As a convention, if we write a 
statement such as 


e>O0 


this will automatically imply that ¢ is assumed to be a real number. 


Exercises 


1. Check in full detail that the complex numbers C form a field under the operations 
of addition and multiplication defined in (1.1, 1.2). 


2. In Figure 1.6 the black dots represent three complex numbers u,v, w (as marked). 
The circle is the unit circle |z| = 1. The open dots a, b, c, d,e,f, g, h represent (in 
some order) the numbers u+ v,u-+w,v+w,u+v-+w, uv, uw, vw, uvw. Which is 
which? 


Figure 1.6 Data for Exercise 2. 
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3. By writing z in the form z = a+ Mi, find all solutions z of the following equations: 
Gi) 22 = —5 +121 
(ii) 22 =2+i 
(iii) (7 + 241)z = 375 
(iv) 22 -(3+iz+(2+ 21) =0 
(v) 2 -3z+1+i=0 


4. If 4 is a positive real number, show that 


{ze C: |z] =Alz—1]} 
is a circle, unless d takes one particular value (which?) 
5. Draw the set of points 
{ze C: re(z +1) = |z— 1}} 


by substituting z = x + iy and computing the real equation relating x and y. 


Now note that re(z+ 1) is the distance from z to the line y = —1, and |z— 1| is the 
distance between z and 1. Compare with the classical ‘focus-directrix’ definition of 
a parabola: the locus of a point equidistant from a fixed line (here y = —1) anda 


fixed point (here (x, y) = (1, 0)). 
6. Draw the set of all z € C satisfying the following conditions: 

(i) re(z) > 2 
(ii) 1 < im(z) < 2 

Gi) 1 <im(@—i) <2 

(iv) |z| <2 
(v) |z| >1 

(vi) 1 < |z| <2 

(vii) |Iz— 1] < 1 

(viii) |z— 1] < |z+ 1 


7. Draw the set of all z € C satisfying the following conditions: 


(i) z=1 

(ii) z+iZ+1+i=0 
(iii) z+ Z+2=0 
(iv) z+z+21=0 


8. Let r,s, 6, be real. Let 


z= r(cos@ + isin@) 


w = s(cos@ + ising) 


Form the product zw and use the standard formulas for cos(9 + @), sin(@ + @) to 
show that arg(zw) = arg(z) + arg(w) (for any values of arg on the right, and some 
value of arg on the left). 

By induction on n, derive De Moivre’s Theorem 


(cos6 +isin6)” = cosnd +isinnd 


for all natural numbers n. 
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Figure 1.7 Data for Exercise 10. 


10. 


11. 


12. 


13. 
14. 


Specialise to the case n = 3 and recover the usual formulas for cos 36 and sin 30 
in terms of cos@ and siné@. 


. Use De Moivre’s Theorem (Exercise 8) and the substitution z = r(cos 6 +isin@) to 


show that the equation z> = I has three distinct complex roots. Find them. 
Compute the square roots of 1 + i./3, /3 — i, and 1 + i, and the cube roots of 
J/3 +i, 1 —i, i. Sketch these points in the complex plane. 


In earlier textbooks, multiplication of complex numbers is often defined as follows. 
Given two complex numbers Z) , Z2, represent them by points A and B in the complex 
plane; and let O, U be the points z = 0, 1 respectively, Figure 1.7. 

Draw triangle OBC similar to triangle OUA (where ZBOC = ZUOA, ZOBC = 
ZOUA). Then z1Z2 is represented by the point C so constructed. 

Using the fact that |z1z2| = |z1||z2|, and the result of Exercise 5, show that this 
construction agrees with our definition (1.2). 


Define a square root ./z of a complex number z to be any complex number w such 
that w2 = z. Prove that every non-zero complex number has exactly two square 
roots, and give formulas for them in terms of re z and imz. 

If a,b,c € C with a ¥ 0, show that the solutions of the quadratic equation 


av’ +bz+c=0 


are precisely 
—b+ Jb? — 4ac 
= 2a 


Use De Moivre’s Theorem (Exercise 8) to compute cos5@ and sin 50 in terms of 


cos 6 and sin@. 
Prove that De Moivre’s Theorem remains true if 7 is a negative integer. 


Define a kth root 4/Z to be any w such that w* = z. Use De Moivre’s Theorem to 
find an expression for </r(cos 6 + isin). 
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In this chapter we collect together the basic topological ideas required for our study of 
complex analysis. The list is not very demanding. Some items are needed to handle dif- 
ferentiation neatly, and some are needed for integration. Differentiation is naturally set 
against a background of limits and continuity, and these are best dealt with on open sets. 
On the other hand, an interval from one complex number to another is computed along 
a specified path between them. A set in which any two points can be joined by a path is 
said to be path-connected. To be able to cope with both integration and differentiation 
in the simplest possible manner later on, we restrict our complex functions to be those 
defined on open path-connected sets. Such a set is called a domain. 

When the set is open we often abbreviate ‘path-connected’ to ‘connected’. The term 
‘connected’ is used in point-set topology with a specific technical meaning, but for 
open sets in C it is equivalent to being path-connected. So this abbreviation does no 
harm. 

Domains can have exotic shapes and paths can wiggle around a great deal. To be 
able to appeal to geometric intuition without our imagination having to work over- 
time thinking about such complications, we use a carefully conceived technical device 
called the Paving Lemma. We show in this lemma that a path in an open set (in par- 
ticular, a domain) can be subdivided into a finite number of smaller pieces in such 
a way that each piece is contained in a disc within the open set, thus ‘paving’ the 
path with discs, see Figure 2.1. A disc is the interior of a circle, which is geomet- 
rically very simple: for instance, any two points in it can be joined by a straight 
line. Joining the end points of pieces of the original path in each disc paving it, we 
obtain a new path made up of straight line segments, still lying in the open set and 
joining the ends of the original path. We see, therefore, that given any path what- 
soever between two points in an open set, no matter how much the path twists and 
turns, there is an alternative path in the open set, between the same points, that is 
made up of a finite number of straight line segments. We can even insist that the seg- 
ments are parallel to the real or imaginary axis, giving a step path in the open set. To 
do so, take a suitable step path inside each paving disc and join them together, see 
Figure 2.2. 

With techniques such as this we can use the Paving Lemma to illuminate complex 
analysis, yielding fully rigorous proofs linked firmly to geometric intuition. 


Topology of the Complex Plane 25 


Figure 2.1 A domain (shaded) and a path in the domain (solid curve). Circles show a finite set of 
discs inside the domain, whose union contains the path. 


Figure 2.2 Replacing the path in Figure 2.1 by a step path. 
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2.1 Open and Closed Sets 


DEFINITION 2.1. For a complex number zo and a positive real number ¢, the 
e-neighbourhood of zo is 
Ne (Zo) = {2 € C: |Z — zol < €} 


This is an open disc of radius «. 
Geometrically, N;(z) is the disc centre Zo of radius ¢, Figure 2.3. 
A subset S C C is said to be open if for every zo € S there is a real number € such 


that N;(zo) C S. We emphasise that e may depend on zo. 


Example 2.2. The disc N;(zo) is itself open, for if z} € Ne(zo) then |z; — zo| < é. 
Choose 6 > 0 such that 6 < € — |z — Z|. By the triangle inequality, Ns(z1) C N, (Zo), 
Figure 2.4. 


DEFINITION 2.3. The complement of a subset S C C is 
C\S={zeC:z¢€S} 
A subset S is closed if C \ Sis open. 


Figure 2.3 The e-neighbourhood of a point zo. 


Figure 2.4 The e-neighbourhood of zp is open. 
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There is another way to characterise closed sets, using the notion of a limit point of 
a subset S. A complex number Zo is a limit point of S if every neighbourhood N,(zo) 
contains a point of S not equal to zo. In this definition, z does not itself have to belong 
to S, though it may do. The essential feature of a limit point of Sis that it has points of 
S arbitrarily close to it. In fact, each N,(zo) must contain an infinite number of points of 
S — for if some N,(z) contains only finitely many points z),...,z, of S, distinct from 
zo, we can take e€; to be the smallest of the distances |zo — z,|. Then Ne, (zo) contains no 
points of S, a contradiction. 

An alternative characterisation of a closed set is: 


PROPOSITION 2.4. A subset S € C is closed if and only if S contains all its limit 
points. 


Proof. Suppose S is closed and zo is a limit point. If z9 € C \ S, which is open, then 
Ne (zo) € C \ S for some ¢ > 0, so N; (zo) contains no point of §, contradicting zo being 
a limit point. Hence Zp € S. 

Conversely, suppose that S contains all its limit points. Then any zo € C \ Sis nota 
limit point of S, so there exists € > 0 such that N,(zo) contains no point of S. Therefore 
Ne (zo) € C \ S, so C \ Sis open, and S is closed. 


Not every point of a closed set need be a limit point of that set. For instance, if 
T= {z€ C:z=0orz=1/n fora positive integer n} 


then the only limit point of T is 0. Since this is in T, the set T is closed. 

Points in S that are not limit points of S are said to be isolated points of S. All points 
of T except 0 are isolated points of T. 

In general, an isolated point zg of Shas a neighbourhood N,(zo) that contains no points 
of S other than zo. For instance, in the set 7, if ¢ = I — Ani then V,(1/n) contains no 
other points of 7. On the other hand, it is clear from the definitions that every element 
of an open set is a limit point of that set. 


We will also need: 


DEFINITION2.5. The closure of asubset S € C is the intersection of all closed subsets 
that contain S. This is closed, and is the smallest closed subset containing S. It consists 
of S together with all limit points of S. 


Limits of Functions 


The notion of a limit 
lim f(z) 
Z—> 20 
is analogous to the real case, and its properties follow by similar arguments. 


DEFINITION 2.6. If f : S — C is an arbitrary complex function and z is a limit point 
of S, then lim,_,,, f(z) = / if, given any ¢ > O, there exists 6 > 0 such that 


for all z € S,0 < |z—o| < 6 implies | f(z) —1| < « (2.1) 
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Two points should be made about this definition: 

(a) The point zo need not belong to S, so f(z) need not be defined. Even if zo € S, 
Ff (Zo) may not equal I. 

For example, if 


0 @#0) 


f= | 1 @=0) 


then lim,_,9 f(z) = 0 4 f(0) 

(b) It is essential that zo is a limit point of S, or else there would exist 6 > O such that 
{z: |z—zo| < 6} contains no point of S. In this case condition (2.1) would be vacuously 
true for any 1 € C. 

As in the real case, we have: 


PROPOSITION 2.7. If zo is a limit point of S and lim, z, f(z) = |, the limit is unique. 


Proof. Suppose that /’ ~ 1 is another candidate for the limit. Take « = 
61 > 0,62 > O such that 


5|1 —I'| to find 


z€S,0 < |z— Z| implies | f(z) —1| < € 


z€ 8,0 < |z—zo| implies | f(z) —I'| < € 


Because Z is a limit point, there exists z* € S, where 0 < |z — z*| < min(6), 52). Then 


2] = fe) +F@")- P| 
Sl=fE 4G) =0) 
<E+E 


=2e 


contradicting the choice of e. 


Standard properties of complex limits may be proved using methods analogous to the 
real case: 


PROPOSITION 2.8. /flimz+~ f(z) = 1, limz-.~ 9(z) = k, then 


Q) lim, + f@) + g@) =!I+k 
Gi) limz+z) f(z) — g@) =I—k 
ii) lim.) f(@g(2) = Ik 
(iv) lim, f(@/g@ =Wk (forl £0) 
Proof. Part (i) is routine. Part (ii) follows immediately if we first show that 
limz-+z(—g9(z)) = —k, which is trivial. 


Part (iii) is a little trickier. Bear in mind that f(zo), g(zo) may not be defined since zo 
need not belong to S. Write 


| F@)g(Z) — Tk] = |F@)g — Ig) + Ig(z) — Tk (2.2) 
< If@ — Mlg@)| + lle) — &l (2.3) 
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Since lim,_,,, 9(z) = k, there exists 69 > 0 such that 
z€Sand0 <|z—zo| <6 implies |g(z) —k| < € 


Therefore |g(z)| < |k| +e = M, say, whenever 0 < |z— zo|. Therefore |g(z)| is bounded 
above near (but not necessarily at) zo by M. 
Since lim,_,,, f(z) = I, there exists 6, > O such that 


z€ Sand0 < |z— zo| < 6; implies | f(z) —1| < ¢/2M) 
Let N € Rsuch that N > |/|, so in particular N > 0. Since lim,-,,, g(z) = k, there exists 
62 > 0 such that 

z€ Sand0 < |z—zo| < 62 implies |g(z) — k| < e/(QN) 
For 6 = min(é9, 61, 52) we then have 


z€ Sand 0 < |z—zo| < 5, implies 


é 


é 
If — Mg@l+ llg@ —kl < saM+ Now =e 


By (2.3), for z € S and 0 < |z— zo| < 62, we have 
| f(z)g(z) — Tk| < € 


proving (iii). 

To prove (iv), it is enough to prove that lim,_,., 1/g(z) = 1/k, and then to appeal to 
(iii) for the functions f, 1/g. 

For this to make sense we must show that for some 6; > 0, g(z) is not zero when 
z€Sand0 < |z— zo| < 61. Then 1/g(z) is defined. 

This follows since lim... g(z) = k and k # 0. Indeed, there exists 5, such that 


z€ Sand 0 < |z—zo| <6; implies |g(z) — k| < FIkI 
which in turn tells us that 
z€Sand0 < |z— Z| < 6; _ implies |g(z)| > ala 
We now find 52 such that 
ze Sand 0 < |z—zol < 52 implies |g(z) — k| < 5|kl’e 
Now, if 6 = min(6q, 42), 
ze Sand 0 < |z—zo9| < 6 implies 


1 1 lk — g(z)| 
a(z)  k | g(z)k| 


The real and imaginary parts of a complex function, 
f(@) = ref(z) + iim f(z) 
may be considered separately. If 


lim fi) =l=a+iB (a,B €R) (2.4) 
Z—>Z0 


< Zlki?e/(Glkl?) = € 


30 


2.3 


Topology of the Complex Plane 


then, because 
|re f(z) — a| = |re(f(z) — DI < (f@ — DI 
we deduce from the definition that 
lim ref(z) =a@ (2.5) 
Z—>Z0 
Proposition 2.8 (ii) now implies that 
lim im f(z) = B (2.6) 
ZxH 


Conversely, if (2.5,2.6) both hold, Proposition 2.8 (i) implies (2.5). 
We can rephrase the preceding argument by recalling that if § C R? the limit of a real 
function ¢@ : S > R of two variables, @(x, y) for (x, y) € Sis defined as follows: 


d(x y) > Aas (x, y) > (a,b) if, given € > 0, there exists 5 > O such that 


for all (x, y) € S,0 < Je — a) + (y—b)* < 6 implies |@(x, y) — Al < € 
Identifying (x, y) with z = x + iy € C, and setting zo = a + ib, this may be written as 
z€Sand0 <|z—z| <6 implies |¢(@, y) —A| <« 
If we now write 
F@) = ue, y) + iv, y) 


where the real and imaginary parts of f are considered as real functions u and v of two 
real variables x, y, we have proved: 


PROPOSITION 2.9. Let z = a-+ib. Then lim,_,~ f(z) =1= a + iB if and only if 


u(x,y) > a@ v(x,y) > B as(x,y) > (a,b) 


In this way, the limit of a complex functions is equivalent to the limit of a pair of real 
functions of two real variables. However, the notation in the complex case is generally 
simpler. 


Continuity 


The definition of continuity for a complex function mimics that for a real function, using 
the complex version of the modulus: 


DEFINITION 2.10. A function f : S — Cis continuous at zo € S if, given e > 0, there 
exists 6 > 0 such that 


for all z € S, |z— zo| < 6 implies | f(z) — f(zo)| < € 


A function is continuous if it is continuous at every point zo € S. 
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If zo is a limit point of S, this is equivalent to saying that lim, ., f(z) exists and 
lim f(z) = fo) 
£20 


If zg is an isolated point of S then there is a neighbourhood N5(zo) that contains no 
other points of S apart from zo, so 


for all z € S,|z — zo| < 6 implies z = zg 


which in turn implies 
lf@) —f@o)| = 0 


So a complex function is always continuous at an isolated point, according to the defi- 
nition. Actually, this will not be an issue for us, because we will consider only functions 
where all points of S are limit points. In fact, S is usually open. But it seemed worth 
tidying up a potential loose end. 

We can rephrase the definition of continuity in terms of open discs: f is continuous at 
zo € Sif, given € > O, there exists 5 > O such that 


for all z € S,z € Ns(z) implies f(z) € Ne Cf (z0)) 


Or, more succinctly, 
f(Ns(Z0)) © Ne(f(Z0)) 


We can develop an alternative way to define continuous functions in terms of open 
sets. First we need a generalisation: a subset V C S is said to be relatively open in S, or 
just open in S for short, if for every zo € V there exists 0 > 0 such that No(zo) NS C V. 


Example 2.11. A relatively open set need not be open. The interval (a,b) = {x ER: 
a < x < b} is open in R, but not in C. In fact, when S = C let x € (a,b) and 
suppose that o > 0. Then x + io/2 € N,Q) = No(x%) NS, but x + io/2 ¢ V. See 
Figure 2.5. 


Figure 2.5 The interval (a, b) is open in R but not in C. 
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Using the standard set-theoretic notation 
-1 
f° WU) = {ze S: f(z) € US 
we get an alternative characterisation of a continuous function: 


PROPOSITION 2.12. A complex function f : S — C is continuous if and only if, for 
every open set U in C, the set f—'(U) is open in S. 


Proof. Suppose that f is continuous and U is open. Let zo € f~!(U). Then f(z) € U so 
there exists € > 0 such that N-(f(zo)) C U. By continuity of f there exists 6 > 0 such 
that 


F(Ns(Z0) NS) © Ne(f (Zo) S U 
Hence 
Ns(zo) NS C f-'(U) 


and f—!(U) is open. 

Conversely, suppose that f~!(U) is open in S for every open set U. Given zo € S and 
€ > 0, the set N-(f(zo)) is open, so ff! (Ne(f(z0))) is open in S and there exists 6 > 0 
such that 


N35(z0) NS Cf! (Ne( F(z0))) 
Hence 


F(N5(Z0) VS) S Ne(f(Zo)) 


so f is continuous. 


Proposition 2.12 is favoured by topologists as the definition of continuity. It is partic- 
ularly useful when S is itself an open set. Then, given a subset V C S that is open in S, 
and any zo € V, there exist o,€ > O such that 


No(zo) GV (because V is open in S) 


Ne(zo) CS (because S is open) 
Taking 6 = min(e, 0), we deduce that 
Ns(Zo) SV 
so V is an open set in C. This proves: 


COROLLARY 2.13. If S € C is open, then f : S — C is continuous if and only if, for 
every open set U, the inverse image f~\(U) is open. 


Figure 2.6 illustrates this result. 
Composition of two functions is defined in the usual manner: if f : S — C and 
g :T — C where f(S) € T, then g of : S > C is given by 


gof(z) = a(f(2)) 
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Figure 2.6 Definition of continuity when S is open in C. 


PROPOSITION 2.14. Iff is continuous at z € S and g is continuous at f (Zo), then gof 
is continuous at Zz. 


Proof. Easy exercise. 


Addition, subtraction, multiplication, and division of complex functions fj : S > C 
and f) : S > C are defined by 


fitf,:S— C where (fi +f =A@+hz) (€S) 
fi —fp :S > C where (f; —f)@) =A@—-hi) «é€S) 
fifa :S — C where (fif2)(2) =filafoa(z) (z €S) 
f/f, :S > C where (f1/fo(z) =fAl@/faz) (ze S’) 
Here S = {z € S : fo(z) ZO}. 


Warning: the composition g o f is often abbreviated to gf when it is clear that the 
product is not intended. 


PROPOSITION 2.15. If f, and fz are continuous, then so are f, +f2,f1 — fr, fifo, and 


fi/f2 (when fr(z) # 0). 


Proof. This is a direct consequence of Proposition 2.8. 


This result lets us show very quickly that certain functions built up from functions 
known to be continuous are themselves continuous. For instance, the constant function 
k(z) = c (where c € C) and the identity function /(z) = z are clearly continuous. We 
give the easy proofs. Let ¢ > 0. For k take any 5 > 0; then 


lz — zo| < 6 implies |k(z) — k(zo)| = |c -c] =0 <e 
For J take 6 = e€; then 


lz — Zo| < € implies |7(z) — I(zo)| = |z — zo] < € 
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On that basis, Proposition 2.15 shows immediately that f(z) = cz is continuous. By 
induction on n, it follows that f(z) = cz" is continuous for anyn € N. A further induction 
proves that any polynomial function 


f (2) = Anz” + an_1z” | +--+ +9 


is continuous. Then any rational function p(z)/q(z), where p,q are polynomials, is con- 
tinuous at any point zo where q(zo) ~ 0. Using these ideas we seldom have to perform 
any intricate epsilon-delta calculations. 


Example 2.16. The modulus function m(z) = |z| is continuous for all z. Given « > 0, 
take 6 = e. Then 


|m(z) — m(zo)| < € implies ||z| — |zol] < |z— zol < € 


using (1.12). 


2 3 , “ 7 : ‘ 
Example 2.17. f(z) = [eff Vie +1066. is continuous for z #4 —1. Proving this with an 
explicit choice of 6 for given € is tedious and messy. A quicker way is to observe that 
|z|? = |z||z| and is therefore continuous, 17z? + 1066z is a polynomial and so is also 


continuous, and the same goes for z + 1. Since we are dividing by this, we must have 


z#-1. 


Writing f(z) = u(x, y)+iv(x, y) for z = x+iy € S, where u, v are real-valued functions 
of the real variables x, y, Proposition 2.9 gives: 


PROPOSITION 2.18. The function f(z) = u(x, y)+iv(a, y) is continuous at zo = xo +ivo 
if and only if u,v are continuous at (xo, yo). 


Example 2.19. If f(z) = 2”, then f(z) = (x+iy)* = x?—y*+ 2ixy. Hence u(x, y) = x?—-y? 
and v(x, y) = 2xy. The function f is continuous for all z € C, just as u,v are continuous 
functions of x, y for all (x, y) € R?. 


An interesting case occurs when S is the real interval 
S=[a,b]={xeR:a<x<b} 


considered as a subset of C. Here z = x + Oi, so we can simplify notation and write 
f(z) = u(x) + iv(x). Now the function f : [a,b] — C is continuous if and only if both u 
and v are continuous. 


Example 2.20. f : [0,1] — C, where f(x) = x? +ix? is continuous, since both u(x) = x? 
and v(x) = x are continuous. 


Functions defined on a real interval play a central role because they define paths, as 
we now discuss. 
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Paths 


In the early development of complex analysis, mathematicians worked out how to inte- 
grate complex functions along paths in C, without worrying too much about the meaning 
of ‘path’. This happened back in the days when notions of continuity and limits were 
still under development. The intuitive idea of a path (or curve) was something that could 
be drawn by moving a pencil (with an infinitely fine point) without making any jumps. 
Such paths were assumed to have various nice properties, such as having a continuously 
varying tangent, or not crossing themselves. These properties were not made explicit 
until the foundational issues in analysis were sorted out, in particular by Bolzano and 
Weierstrass. 

Later it turned out that some of these assumptions are not necessary to develop a 
satisfactory theory of integration, while others must be made precise to avoid running 
into contradictions. We have no wish to make readers repeat all the twists and turns of 
history; as we have said, some battles have already been won. But from time to time 
we find it useful to point out some of the pitfalls that can arise, and to emphasise some 
subtle distinctions — for example, the definition of a path as a map y from a real interval 
[a, b] into the complex plane, and the distinction between this map and its image. Our 
pictures generally show the image, leaving the reader to infer the map. We often add an 
arrow as a reminder that as a parameter f runs through the interval [a, b], the point y (7) 
moves along the image in a specific direction. 

One of the pleasant features of complex analysis is that fairly often that information 
is all we need. At various points in the book we also find it useful to impose further 
conditions on paths, to ensure that they have appropriate properties. 

The upshot of more than a century of deep contemplation and debate is the following 
definition: 


DEFINITION 2.21. A path in the complex plane is a continuous function 


y:[a4b] >-C (a<beR) 


Its initial point is y(a) and its final point (or end point) is y(b). 


Sometimes we speak of y as ‘a path in C from z; to z2’ when z; = y(a) is the initial 
point and z) = y(b) is the final point. When ¢ € [a, b] we also refer to z = y(t) asa 
‘point on the path y’, although strictly speaking z is on the image of the function y. If 
we think of ¢ as ‘time’ and imagine f increasing from a to b, the point y(f) traces a curve 
in the plane from y(a) to y(b). When drawing a diagram we often indicate the direction 
of motion (increasing ft) by an arrow, as in Figure 2.7. However, it must be emphasised 
that this is a makeshift device since the curve may cross itself or turn back on itself, so 
the picture may get quite complicated. We pick up this point again in Section 6.7. 


Standard Paths 


To simplify matters, when we speak of line segments and circles, considered as paths, 
we assume they are specified by the following standard functions: 
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Figure 2.7 A path in C, showing its parametrisation by t € [a, b]. 


(i) The line segment L = [z1, z2] from z1 to z2 is 
La =(1—-d)atte (te [0,1)) 
When z; = a, z2 = b € R then the image of the path is the closed interval [a, b] C 


RCC. 
(ii) The unit circle C: 


Ci) =cost+isint (ft € [0,27]) 
(iii) The circle S with centre z and radius r > 0: 
S(t) =zo t+ r(cost+isint) (¢ € [0, 27]) 


See Figure 2.8. 


Figure 2.8 Three standard paths. 
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Figure 2.9 ‘Graphs’ of paths considered as functions [a, b] > C. Left: A path that travels once 
along the image in a fixed direction. Right: A path that travels once along the image, turns round 
and travels back, then turns again and travels along the image for a third time. 


Visualising Paths 


We are used to visualising a function f : R — R using the corresponding graph y = f(x) 
in the plane. A similar idea can be exploited to visualise paths in a more explicit way. 
Figure 2.9 shows analogues of graphs for two maps y : [a,b] — C. The arrows join 
representative points t € [a,b] to their images y(t) € C. The left-hand figure illustrates 
a path where y(t) travels along the image without turning back on itself. The right-hand 
figure illustrates a path where y(t) travels along the image to the far end, turns back on 
itself to revisit the start, then turns back again to travel along the image for a third time. 
Although paths like this are seldom encountered in practice, they are allowed by the 
definition, so we have to take that possibility into account. Example 6.17 below shows 
that simple formulas can produce this type of behaviour. 

An alternative would be to change the definition to rule out such behaviour, but most 
of the theory works in the general case with no extra effort, so it turns out to be simpler 
to leave the definition as it stands, while bearing in mind that simple pictures might be 
misleading in some respects. 


The Image of a Path 


To emphasise the distinction between a path and its image we introduce a further 
definition: 


DEFINITION 2.22. A curve C from z to z2 in the complex plane is a subset of C that 
is equal to the image of a path y : [a, b] — C, where y(a) = z; and y(b) = z. 

Given a curve C, a parametrisation of C is a path o : [c,d] — C such that C is the 
image of o, where o(a) = z and o(b) = 22. 

The corresponding parameter is a variable t € [a, b]. 


We sometimes call [a, b] the parametric interval of o. 


38 


2.5 


Topology of the Complex Plane 


1 +i 


Figure 2.10 The two distinct paths y; , v2 have the same image, so they define the same curve. 


If we think of ft as ‘time’, and imagine a particular parametrisation y : [a,b] > C, 
then as ¢ increases froma to b the point y() traces the corresponding curve in the plane 
from y (a) to y(b), moving continuously with ¢. 


Example 2.23. The same curve may be traced by many different paths. For example, in 
Figure 2.10 the paths 


ni) =2t+it) O<t< 5) 
yt) =t? +i? (0<t<1) 


traverse the same curve 


{xtiye C:x=y,0<x< 1} 


Change of Parameter 


Definition 2.21 equips every path y : [a,b] > C with a parameter t € [a,b]. It is often 
convenient to change the parameter and its parametric interval, because suitable choices 
can simplify calculations and proofs. In this section we discuss such changes. 

We begin with Example 2.23, which distinguishes a path from its image by exhibiting 
two different paths that have the same image curve. Dynamically, these two paths trace 
the same curve in the same direction, but at different speeds. They are related by the 
change of parameter p : [0, sl — [0,1], where p(t) = 1. This has an inverse map 
p~! : [0,1] — [0, 4] given by p~!(s) = /2s. 

We now discuss general changes of parameter, defined as follows: 


DEFINITION 2.24. Let y : [a,b] — C bea path, and suppose that p : [c,d] — [a,b] 
is continuous and satisfies 


pc)=a p(idy=b 
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so in particular p is onto [a, b]. Then the composition y o p : [c,d] > C is also a path. 
We call y o p a reparametrisation of y. 
Note that the parametric interval changes when [c,d] £ [a,b]. 


Preserving Direction 


If we impose extra conditions on p, a change of parameter can preserve extra properties. 
The image of the path is important, but another property is also vital: the direction in 
which the path is traced. Now our mental image involves the dynamics of a point moving 
along the path, as well as the image that it traces out. 

At this stage we are working with general continuous paths, so we adopt the fol- 
lowing approach. Recall that a map o : [a,b] — [c,d] is strictly increasing if 
a <t, < t) < bimplies o(t;) < o(t,). Both of the above maps p and p~! are strictly 
increasing. 

In real analysis it is proved that any strictly increasing continuous map has a strictly 
increasing continuous inverse. We give the easy proof for completeness: 


LEMMA 2.25. If p : [a,b] — [c, d] is continuous and strictly increasing with p(a) = c 
and p(b) = d, then p has a strictly increasing continuous inverse a : [c,d] —> [a, b]. 


Proof. If s € [c,d] then p(a) < s < p(b). By the Mean Value Theorem there exists 
t € [a,b] such that p(t) = s. This determines a strictly increasing inverse function 
po! : [c,d] — [a,b], where p and oe! both map open intervals to open intervals, so 
p' is also continuous. 


We can now state: 


DEFINITION 2.26. Let C be a curve, parametrised by two maps y : [a,b] — C and 
i : [c,d] — C. The maps have the same direction if there is a strictly increasing function 
p : [a,b] — [c,d] such that 


y=Aop 


Then p is a direction preserving change of parameter. 


If the change in parameter p is not increasing, it can lead to part of the curve being 
traced back and forth as ¢ increases. In Figure 2.11, as f increases from p to qg, the value 
of o(t) increases, then decreases, then increases again. As this happens, the points on 
the curve from P to Q are traversed first in the direction from P to Q, then back to P, 
then from P to Q once more. The curve is the same, but the distance travelled increases. 
This means that when we calculate the length of a curve, defined in Section 6.3, we must 
prescribe how the curve is traced. 


Subpaths and Sums of Paths 


In the general theory, and in applications, it is useful to be able to chop paths into pieces, 
or join pieces together. 
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Figure 2.11 Left: A change in parameter. Right: If p has a graph like this then the parameter ¢ 


changes direction along the image curve. 


DEFINITION 2.27. For an arbitrary path y : [a,b] — C, a subpath is obtained by 


restricting y to a subinterval [c,d], where a< c <d <b. If 


A=Xx% <x, <-:-<x,=) 


and y; is the subpath obtained by restricting y to [x;-1,x,], we write 


Y= + Yn 


In so doing, we think of y as being made up by tracing along the paths y,,.. 


in order, see Figure 2.12. 


eeee oe ee 


Figure 2.12 Decomposing a path into subpaths. 


-> MY, taken 


Similarly, if the final point of a path y; coincides with the initial point of a path y2, 
it is useful on occasion to create a combined path by first tracing y; and then y2. There 
is a minor technicality here, because y;(b) can be the same as y2(c) even though yy is 
defined on [a,b] and 7» defined on [c,d] where b 4 c. The next example shows how to 


get round this difficulty. 
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Figure 2.13 Joining two paths by shifting parameters. 


Example 2.28. Suppose that 


yig=t (te [0,1]) 
y2(t) = cost+isint (t € [0,z]) 


as in Figure 2.13. Here y;(1) = y(0) but 1 4 0. 


In such a case we extend our notation a little. Suppose that y; : [a,b] > C, and 
v2: [c,d] > C, with y(b) = y2(c). Then we define the combination y = yj + y2: 
[a,b+d—c]—> Cby 


_ fu (t € [a, b]) 
r= | ywt+e-b) (te[b,b+d-cl) 


In effect, what we have done is to shift the parametric interval of the second path from 
[c,d] to [b,b +d — c], by adding b — c to all points in [c, d]. 
In Example 2.28, for instance, 


pe t (t € [0, 1]) 
YO) eee Cine) Geld ep 


and y consists of the line segment 7, followed by the semicircle 72. 

The path y1 + yz is defined only when the final point of 7; is the same as the initial 
point of y2, so perhaps this is not a fully appropriate use of the + sign. However, we do 
have 


Mtwtw=aN+t2+yH) (2.7) 


whenever the appropriate end points coincide, so we can omit the parentheses in such a 
‘sum’, and more generally write y; + ----+ yp, to indicate the path obtained by succes- 
sively tracing yj, ..., Yj, whenever the final point of each y,_, coincides with the initial 
point of y(1 <r<n). 
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Equation (2.7) states that addition of paths is associative, but in general it is not com- 
mutative. Exercises 7 and 8 provide simple examples to show that when y; + y2 is 
defined, y2 + yi may not be, and even when it is, y1 + y2 need not equal y2 + 1. 

For a path y : [a,b] > C, another useful concept is the opposite path —y : [a, b] > 
C defined by 


—y@=yat+b— (te [a,d)) 


As t increases from a to b, so —y describes the same curve as y, but it does so in the 
reverse direction. If y is a path from z, to z2, then —y is a path from 22 to z). 


Example 2.29. Suppose that L is the line segment [z), z2] given by 
K)=(1 —pat+itze (te [0,1] 
Then —L is given by 
Lt) = t+ (1—2t)z (¢€ [0,1)) 


which is, of course, [z2, z |. 
If S is the circle centre zp and radius r > 0, then 


S(t) = r(cost+isint) (t € [0,27]) 
describes the circle once, anticlockwise, while the opposite path 


—S(t) = zo + r(cos(2m — ft) +isin(2w —1t)) (t € [0,27]) 
=z +r(cost—isint) (t € [0,27]) 


describes it once, clockwise. 


In a sum such as y; + (—y2) + 73 we omit the parentheses and write y, — y2+ y3. 
This is the path that traces first along y;, then back along the opposite path to y, 
and then along y3. As always, the appropriate final and initial points must agree. See 
Figure 2.14. 


Figure 2.14 The path y; — yo + 3. 
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Figure 2.15 The path Cr + Ly — Ce +L. 


Example 2.30. Suppose that 0 < « < Rand 
Cr(t) = R(cost+isint) (t € [0,7]) 
C;(f) = e(cost+isint) (t €[0,7]) 
Lii=t (t € [—R,—-e]) 
Inf)=t (t € [e, R]) 


Then Cr +L; — C; +2 is the path describing the curve in Figure 2.15 once, anticlock- 


wise. 


The Paving Lemma 


We now come to a technical lemma that plays a vital role in our approach to complex 
integration, which we call the Paving Lemma. This result is so important that we present 
two different proofs of it. To set it up, we first need another piece of terminology. 

A path y : [a,b] > C is said to be a path in S if the image 


{y(t) :t € [a,b]} CS 
We signify this by writing y : [a,b] > S. 
Example 2.31. The unit circle C(t) = cost+isint(t € [0,2z7]) is a path in the region 


S={zeC: a < |z| < 2}, Figure 2.16. 
A region of this form, contained between two concentric circles, is called an annulus. 


This example is very simple. It must not lull us into a false sense of security: far more 
intricate paths are possible. 
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Figure 2.16 Path in an annulus. 
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Figure 2.17 A more complicated path. 


Example 2.32. Let V = {z € C: z 4 1/n for a non-zero integer n} and define 


_ f ttitcos(a/t) (¢ € [-1, 1] \ {0} 
om =| (t = 0) 


Then o is a path in V, Figure 2.17. Here o winds in between the points 1/n where n is 
a non-zero integer, crossing the real axis when t = 1/(n + 5): 


It does not take much imagination to realise that an open set S can be very compli- 
cated, and a path y can be intertwined in S in a very intricate manner. The space-filling 
curve described in Section 2.9 is given by a path whose image is the unit square, includ- 
ing its interior. Much more complicated images are possible. Rather than trying to 
imagine all possible intricacies, we sidestep the issue, as follows. If S is an open set, 
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Figure 2.18 Paving a path with a sequence of discs. But could this process get stuck? 


we can cover y with a finite number of open discs, all contained in S. Then we use the 
discs to simplify the path, as the next result explains. 


LEMMA 2.33 (Paving Lemma). Let y : [a,b] — S be a path in an open set S. Then 
there exists a subdivision a = tg < ty < ++: < t, = b such that each subpath y, 
obtained by restricting y to [t,—, t,] lies inside an open disc D, C S. 


Proof. Because S is open, there is an open disc D; such that y(a) € Di CS. 

The idea of the proof is that if the path has been paved up to some point, we can 
always add another disc to take the paving further, unless we have reached the far end. 
Making this argument rigorous takes a little care, however, as we now indicate. 

The disc Dy is the first in a sequence of discs, obtained by moving along the path, 
covering it bit by bit with discs as in Figure 2.18. The existence of D; shows we can get 
started. Indeed, since y is continuous, there exists 6 > O such that 


y(t) € Dy whenevera <t<a+6 (2.8) 


Our only fear is that we might not progress all the way; for instance, the discs might 
decrease dramatically in size and we might never reach the end. Some real analysis 
dispels this fear. If a < x < b, say that y restricted to [a, x] ‘can be paved’ if it can 
be subdivided into a finite number of subpaths y1,...,7%n, where each 7, lies inside an 
open disc D; C S. Let P be the set of all x € [a,b] such that [a, x] can be paved. We 
know that a € P so P is non-empty. 

Also P is bounded above by b. Therefore, by real analysis, P has a least upper bound 
c, where a < c < b. We claim that c = b, which will prove the lemma. 

For a contradiction, suppose that c < b. Since S is open, there exists an open disc D 
such that c € D C S. By continuity of y, there exists ¢ > 0 such thatifc—e <t<c+e 
and ¢ € [a, b], the image y(t) € D. 

By (2.8), a+ 6/2 € P,soc > a. Since c is the least upper bound of P, there exists 
x € Pwithc—e <x < c. Therefore the segment {y(f) : a < t < x} can be paved by open 
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discs D,,...,D,, all lying inside S. Let D,,,; = D. Then the segment {y(): a < t <c} 
can be paved by open discs Dj,..., Dm, Dm+1 all lying inside S. 

Finally, we claim that c = b. If c < b, continuity of y implies that y(c+ 4 € D for 
all t such that 0 < t < Tt, therefore [a,c + t/2] can be paved, contradicting c being an 
upper bound for P. Therefore c = b so [a, b] can be paved as required. 


For future reference, and because this result is used repeatedly, we now give a second 
proof, which in some respects is simpler. A generalisation to two dimensions will also 
prove useful in Proposition 9.5. 


Alternative Proof of Lemma 2.33. We show more: the subdivision can be obtained by 
repeatedly bisecting [a, b] a finite number of times. For the purposes of this proof only, 
it is useful to define and interval to be pavable if it can be bisected repeatedly a finite 
number of times into closed intervals, so that the image of every interval obtained in this 
manner lies inside a disc in D. Otherwise, the interval is unpavable. 

Suppose the result is false. Then [a, b] is unpavable. Bisect [a,b] to get two closed 
intervals, intersecting at the midpoint. At least one these intervals, call it 7; , is unpavable. 
Otherwise we could pave a finitely repeated bisection of each interval by finitely many 
discs, whose union paves [a, b]. 

Bisect J;. The same argument applies to the two halves, so we get an unpavable 
closed interval Ib C J; of half the size. Because we are assuming the result false, this 
sequence of bisections continues indefinitely to give a nested sequence of unpavable 
closed intervals 


[a,bJ 2h Dh2--- 


each half the size of the previous one. 

The intersection of these intervals is easily seen to be a unique point z € D (Cauchy 
sequences of real numbers converge). However, D is open, so there is a disc Ne (y(zo)) © 
D. Since y is continuous, y (Ne (y(zo))) is open in [a,b] and contains zo. Therefore it 
contains /, for sufficiently large n. Now I,, C N,(y(Zo)), So it is pavable (with one disc), 


a contradiction. 


In Example 2.31, for any point on the unit circle C, we have Nj /2(z) © S. Clearly a 
finite number of such discs will pave C within S, Figure 2.19. 

However, the path in Example 2.32 cannot be paved by a finite number of discs in V. 
Any disc containing the origin (which lies on o includes points of the form 1/n lying 
outside V. But this also means that V is not open. So the conditions required for the 
Paving Lemma are not satisfied. 

Readers familiar with basic point-set topology will recognise that the proof is ‘really’ 
about the compactness of the closed interval [a, b]. We prefer not to develop the abstract 
machinery of compactness here; the Paving Lemma performs the same task. 


Connectedness 


We now define an important topological property, which is central to the theory of 
complex functions. 
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Figure 2.19 Paving a circular path inside an annulus is easy. 


Figure 2.20 A disc is path-connected. 


DEFINITION 2.34. A subset S C C is path-connected if, given z,,z. € S, there exists 
a path y in S from zj to Z9. 


Example 2.35. Any open disc N,(zo) is path-connected. For, given z1,z2 € N;(zo), let 
L(t) = (1 — pz + tz, (t € [0, 1]). Then 
|L(t) — z)| = |(1 — t)z1 + tz2 — zol 
= | —d2 — (1 — Dep + to — tal 
< (1 —d)lz1 — Zol + Aza — Zl 
<(l-f)r+t¢4r=r 


so the line segment L lies in N,(z). See Figure 2.20. 


Example 2.36. The set S = {z € C: z=t+if,t € R} is path-connected. For if t) + iff 
and f2 + it; € S where t) < f2, then y(t) =t+ i (t € [t1,f2]) is a path in S between 
those points. See Figure 2.21. 
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Figure 2.21 The set{zeC:z=t+ it?,t € R} is path-connected. 


Figure 2.22 A step path. 


It is often convenient to use a very simple type of path. 


DEFINITION 2.37. A step pathin Sis a path y : [a,b] — S together with a subdivision 
a= < t, < +--+: < t = b such that on each subinterval [t,_1,¢,] either re y or imy 
is constant. This means that the image of y consists of a finite number of straight line 
segments, each parallel to the real or imaginary axis, see Figure 2.22. 

In other words, y is a step path if y = y, +---+ y, where each y, is a line segment 
parallel to the real or imaginary axis. 


A subset S C C is step-connected if, given z1,z2 € S, there exists a step path y in S 
from z1 to z2. 


Evidently a step-connected path is path-connected. The converse, however, is not 
always true: Example 2.36 is path-connected but not step-connected. A path as simple as 
the diagonal line segment [0, 1 + i] is path-connected but not step-connected. The main 
issue here is that unlike a general continuous path, a step path consists of finitely many 
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straight line segments. But the requirement that these segments should be horizontal or 
vertical also comes into play. 

These considerations are mentioned here to help develop intuition, but they do not 
cause problems for the theory, because every path-connected open set is step-connected. 
This constitutes the first success — albeit a minor one — for the Paving Lemma. To prove 
this, we first observe a simple fact: 


LEMMA 2.38. Any open disc is step-connected. 


Proof. Consider an open disc S = N,(zo). Changing coordinates to z — zo (translating 
the origin) we may assume without loss of generality that z) = 0. Let z} = x; + iy, € S. 
We show that there is a step path in S from z, to the centre 0. Let L; be the (vertical) 
line segment from z; to x;, and L2 be the (horizontal) line segment from x; to 0. Then 
simple inequalities show that L; and L2 are inside S, and L; + L2 is a step path from z 
to 0. 

By the same argument, if z. € S there is a step path L3; + L, in S from z to 0. But 


now L; + L2 — L4 — Ly is astep path in S from z, to Z2. 


We can now prove: 
PROPOSITION 2.39. A path-connected open set is step-connected. 


Proof. Let z1,z2 in S, which is open, and let y be a path from z; to z2 in S. 

By the Paving Lemma there is a finite sequence of open discs Dj,...,D, C S anda 
path y,; in each D, such that y = y, +--- +}. 

By Lemma 2.38 there is a step path o, in each D; with the same initial and final points 


as 7. Nowo = 0) +---+ 0; is a step path in S from z, to z2. 


On the other hand, the parabola S in Example 2.36 is path-connected, but not step- 
connected. This does not conflict with the proposition above, because S is not open 
in C. 

Arbitrary sets need not be path-connected. However, for any set S we can define the 
relation z1 ~ z2 (z1, z2 € S) to mean that there is a path in S from z) to z2. It is easy to 
see that ~ is an equivalence relation (Exercise 7 below) and that each equivalence class 
is path-connected. These equivalence classes are called the path-connected components, 
or for simplicity the connected components or just components of S. (We always take S 
open in the sequel, so here is no conflict with the more general notions of connectedness 
and connected components that are standard in point-set topology, but irrelevant here.) 

For instance, the connected components of 


A={zeEC:|z| Al} 
are clearly 
Ay ={zeEC: [z| < 1} Ar ={zeEC: |Z) > }} 


If S is open then its connected components are all open: given zo in a connected com- 
ponent Sp of S, since some N,(zo) € S and N,(zo) is path-connected, we must have 
N,(Zo) © So, so So is open. 
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A particularly interesting case is the complement of the image of a path y : [a, b] > 
C, or more briefly the complement of a path, by which we mean 


{ze C:z A y(t) for any te [a,b]} 
The complement may be connected, for instance when 
y@j=t (te [0,1)) 


On the other hand, it may have two or more components, Figure 2.23. 
Indeed, the complement of y = 71 — y, where 


yi0)=0,yu(@) =t+itsin(z7/) (0 <t<1) 
y2(0) = 0, w2()) =t—itsin(7/) (0 <t<1) 
has infinitely many components, Figure 2.24. 


DEFINITION 2.40. A subset S C C is bounded if there exists K > 0 such that |z| < K 
for allze S. 
A subset S € C is unbounded if it is not bounded. 


Figure 2.23 Paths whose complements have one, two, or three components. 


Figure 2.24 A path whose complement has infinitely many components. 
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However complicated a path may be, we can prove: 


PROPOSITION2.41. (i) The image of a path in C is closed and bounded. 
(ii) The complement of a path in C is open, and precisely one component is unbounded. 


Proof. For the second part of (i), let the path be y : [a,b] — C, and let m() = |y(O|. 
Since it is the composition of two continuous functions, m : [a,b] — R is continuous. 
By real analysis it is bounded, say m(t) < K, whence |y(t)| < K for all t € [a, b] and 
the image of [a, b] lies inside the disc centre 0 of radius K. 

The first part of (1) follows from (ii). 

To prove (ii), suppose that zo lies in the complement T of y. Then 


L(t) = |y@ — zo| 
is a positive real number for all t € [a,b]. Further, 2 is continuous, so it is bounded and 
attains its bounds. That is, there exists k > 0 such that w(t) > k for all t € [a, b], and 
there exists fo € [a,b] such that w(to) = k. 


If k = 0 then zo € y({a, b]), but itis in T. Sok > 0. Then Nk(zo0) € T, so T is open. 
Finally, the image of y lies inside 


A={zeEC: |z| < K} 
and the complement 
B={zeEC: [z| > K} 


is clearly connected. Therefore any unbounded component of T intersects B, and since 
components are disjoint, at most one of them can do so. Further, B lies inside some 
component, so there is exactly one unbounded component. 


REMARK 2.42. A closed, bounded set in R”, in particular one in C = R?, is said 
to be compact. Compactness is a very important property in point-set topology (where 
the definition is given in a more general form). In this text we do not need to set up 
the full machinery of compactness, because the simple cases that arise can be tackled 


bare-handedly using real analysis — as we did in proving Proposition 2.41. 


We now define a special type of set that is fundamental to the theory in this text. 


DEFINITION 2.43. A domain is a non-empty, path-connected, open subset of the 
complex plane. 


By Proposition 2.39, a domain is also step-connected, and this property will be useful 
later. This means that we can refer to a domain as being ‘connected’, meaning either 
path- or step-connected as appropriate. 

As the theory of complex analysis unfolds, the reader will see the immense impor- 
tance of this definition. We restrict the concept of a complex function f to those of the 
form f : D + C where D is a domain. Openness of D lets us deal neatly with limits, 
continuity, and differentiability, because zo € D implies that N;-(zo) C D for some ¢ > 0, 
so f(z) is defined for all z near z9. Connectedness of D guarantees there is a (step-) path 
between any two points in D, which lets us define the integral of f along such a path. 
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However, restricting complex functions to those defined on domains has far subtler 
consequences than merely providing a platform for the appropriate definition. Those 
who have studied the intricacies of real analysis in depth will find untold riches in com- 
plex analysis, quite unlike the real case. For instance, if two differentiable complex 
functions are defined on the same domain D and they happen to be equal on a small 
disc in D, they are equal throughout D. No such result holds for general differentiable 
functions in the real case. This result is just one of many that illustrate the beauty and 
simplicity of complex analysis. We shall not prove it until Chapter 10, but we mention 
it here to underline the fundamental importance of establishing appropriate topological 
foundations for the subject. 


Space-filling Curves 


A space-filling curve is defined by a path whose image fills a two-dimensional region, 
typically the unit square. Such curves show that our default mental image of a continuous 
path can be distinctly misleading. To explain why care is needed, we digress to describe 
one simple construction of such a path. 

Let U? be the unit square {x +iy:0<x<1,0<y<l]}. 


DEFINITION 2.44. A space-filling curve is a continuous path y : [0,1] — U? such 
that the image of y is the whole of U?. 


Giuseppe Peano constructed the first space-filling curve in 1890, motivated by Georg 
Cantor’s proof of 1878 that [0,1] and U? have the same transfinite cardinal. In 1879 
Eugen Netto had proved that there is no continuous bijection between these sets; Peano 
showed that such a map exists if we do not require it to be bijective. In 1891 Hilbert 
published another example of a space-filling curve, and included a picture. Both con- 
structions are fairly complicated: Figure 2.25 shows early stages in their construction. 
Many others have since been found. Details can easily be found on the Internet, and are 
given in Sagan [18] and Bader [2]. 

We give a simpler construction, which at each stage produces a curve whose image is 
a square grid. 

Peano’s result was unexpected because it challenges the naive notion of ‘dimension’. 
A curve has dimension 1, but U? has dimension 2. It was counterintuitive that a contin- 
uous map can increase the dimension of a space. Exercises 15—17 show that there is no 
bound on the size of this increase. This phenomenon led to more careful theories of the 
concept of dimension for topological spaces, and the realisation that stronger conditions 
than continuity are needed for some intuitively plausible properties to hold. 

The idea behind the construction of such curves is purely topological. It relies on 
simple properties of uniform convergence, as studied in real analysis courses. It requires 
the definition and properties of the closure of a subset S C C, stated in Definition 2.5. 


PROPOSITION 2.45. Let ym : [0,1] — U? be a sequence of continuous paths, for 
n €N. Suppose that: 
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te 


Figure 2.25 Left: An early stage in the construction of Peano’s space-filling curve. Right: An early 
stage in the construction of Hilbert’s space-filling curve. 


(i) The sequence of functions (yn) is uniformly convergent. 
(ii) The closure of the union of the images of all yp is the whole of U?. 


Then the limit 


y = lim yp, 


n> oo 
is continuous, and its image is U?. 


Before giving the proof, it must be made clear that we are not just asserting that y 
comes close to every point of U?. It actually passes through every point of U? in the 
sense that if z € U? there exists t € [0, 1] such that y(t) = z. In contrast, the union of the 
images of the yp is not the whole of U7. It is just dense in U?: that is, its closure is U*. 


Proof. By condition (i), a basic theorem in real analysis implies that the limit y exists 
and is continuous. 

Let z € U?. By condition (ii), for each m > 0,m €N, there exists tm € [0,1] and 
Mm € N such that 


1 
lYnm (tm) — Z| << — 
m 


The subsequence yp,, also tends uniformly to y. The sequence (fm) lies in [0, 1] which 
is closed and bounded, so it has a convergent subsequence, with limit fo. We claim that 
y (to) = z. By uniform continuity, if m is large enough, 


1 
lYnm (to) — Y (to)| < — 
m 
So 


5 
ly(to) —2| < a 


for all m, so y(to) = z as claimed. 
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We now construct such a sequence of maps y, : [0,1] —> U?. We prescribe it 
geometrically, but making the construction precise is routine. 

Define yo so that its image is the boundary of U? described anticlockwise, parame- 
terised proportionally to arc-length; see top left of Figure 2.26. 

To obtain y;, replace each segment of y) by a T-shaped polygon, as shown at the 
bottom of Figure 2.26. The extra part of the T always extends to the left of the segment, 
as oriented by the arrow. The path y; goes along the original path to its midpoint, turns 
left, turns back on itself to return to the midpoint, and then continues along the original 
segment to its end point. If we do this for each of the four segments of yp and add the 
T-shaped paths (whose ends and starts fit together correctly) we obtain the path y, at 
top right of Figure 2.26. For clarity, the grey polygon shows the direction in which t 
traverses y1. 

This path visits the centre of the square four times, but all segments are treated in 
the same manner, making it easier to define the y, inductively. We scale the arc-length 
parametrisation so that each y, has the same parametric interval [0, 1]. 

We now repeat the same construction on every straight segment of , to obtain j2, 
and continue inductively to define y, for alln € N. 

The image of y, is a square grid of lines separated by distance 2~”. Clearly the union 
of these images is dense in U?: it consists of all z= x + iy for which either x or y is of 
the form m2~” for 0 <m < 2”, where m € N. So condition (i) holds, and we can define 
y as the limit of the yp. 

By considering the construction step at the bottom of Figure 2.26, it is easy to prove 
that 


lYn(t) — Ynti(O| < wees on 


from which condition (ii) follows (by summing a geometric series). 

Therefore y is a space-filling curve by Proposition 2.45. 

Although the image of y, is a grid, the actual structure of y becomes increasingly 
complicated because of the way it zigzags back and forth. If we write 


~« T - 
A A 
v v 
y 
A +> +> 
A A 
y vy 
———_»—___— —_>—-__»— 
A 
v 
e ~ - i 


Figure 2.26 Inductive step used to construct a space-filling curve. 


2.10 


2.10 Exercises 55 


Figure 2.27 Graphs of real and imaginary parts of yo, y,. Grey lines included for comparison 
purposes. 


Ynlt) = X(t) + iyn(t) 


SO Xn, Yn : [0,1] — [0, 1], then Figure 2.27 shows the graphs of xo, yo and x), y1. 

The slopes of the parts of these graphs that are not horizontal become increasingly 
steep as n increases. Nevertheless, the limit is continuous — because of uniform conver- 
gence, and more intuitively, because the intervals on which such slopes occur become 
arbitrarily small (though there are increasingly many such intervals). 

Exercise 15 shows that the unit hypercube in R” is also the continuous image of an 
interval in R. In fact, far more complicated sets can be the images of space-filling curves. 
The last word in such results is the Hahn—Mazurkiewicz Theorem: proved by Hans Hahn 
and Stefan Mazurkiewicz. This states that a set S is the image of a continuous map from 
[0, 1] to a Hausdorff topological space if and only if S is compact, connected, locally 
connected, and metric. See Hocking and Young [9], page 129. 


Exercises 


1. Let z € C anda, b € R be arbitrary. Draw the set consisting of all z € C satisfying 
the following conditions. In each case, state whether the set is open, closed, or 
neither. 

(i) 1 < |z| < 2 

(ii) rez > Oandimz < rez 

(iii) rez > Oand 1 < |z| <2 
(iv) re(zzo) > O 
(v) a < arg(z— Zp) < bwhere -t7 <a<b<a7 
(vi) |z— a| = |Z — Zol 

(vii) |z— 4| < [z— zol 

(viii) |z— | < |Z —zol 

(ix) re(z’) > 0 
(x) re(z) < 0 

(xi) re(?) > 1 

(xii) re(?) < 1 

2. Prove Proposition 2.8 parts (i) and (ii). 
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3. Find the following limits, if they exist. If they do not exist, explain why. 


(i) lim,—+o |z|/z 

(ii) lim, |z|?/z 

(iii) lim, 9 2/Iz? 

(iv) lim, o(z — rez)/im z 

Prove from first principles that the following functions are continuous: 
(i) rez 

(ii) imz 

(iii) z+ |z| 

(iv) I/z @ #0) 


(v) |zI? 


5. Prove Proposition 2.14. 


6. For each of the following sets and pairs of points, define (if possible) (a) a path in 


the set between the two points, and (b) a step path in the set between the two points. 
(i) |z| < 2;1+1,1-1 

(i) |z| = 1;-1,1 

(ii) 1 < |z| < 2; V2, -V2 

(iv) |re z| > 5; —9 + 37i, 1066 + ia + 5/17) 

(v) [1 —lzll > 35 1/3,490 +i) 


7. Let y; = [i,0] and yz = [0, 1]. Show that y, + 7 is defined, but y2 + yj is not. 


8. Let y, = [0, 1] and y2 = [1,0]. Show that y; + y2 and y2 + 1 are both defined, but 


10. 


11. 


12. 


they are different paths. 


Let S be a subset of C. If z, w € S, define z ~ w if and only if there is a path from z to 
w. Show that ~ is an equivalence relation. The equivalence classes are components 
of S. If S is open and non-empty, show that each component is a domain. 


Let S be a path-connected subset of C, and let f : S — C be a continuous function. 
Prove that f(S) is path-connected (even though it may not be open). 


Give explicit functions for paths that describe the curves of Figure 2.28 in the direc- 
tion indicated by the arrows. (All subpaths are parts of circles or line segments; 
0 <e < Rand x1,X, y, y2 are positive reals.) 


Let S be a subset of C. A point z € Cis a boundary point of S if zis a limit point of 
S and also a limit point of the complement C \ S. The boundary aS of S is the set of 
all boundary points of S. In the following cases, describe 0S and state whether 0S is 
path-connected. Draw a picture in each case. 

@) S={zEC:1 < |z| < 2} 

(ii) S={zeC:z40} 
(ii) S = {z € C: z=x-+ iy where x,y € Q} 
(iv) S={zeC:0<rez<10<z< 

(v) S = the intersection of the sets S in (iii) and (iv) 
(vi) S={zeC:zHiywherey € R,y < 0} 
(vil) S = the intersection of the sets S in (vi) and (ii) 
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AOA 


Figure 2.28 Six paths in C: write down formulas for them. 


Figure 2.29 Why can paths like this not be used to construct a space-filling curve? 
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13. In the following cases the boundary of S (see Exercise 12) can be described as the 


image of a path. Draw a picture of S and specify a function y giving such a path. 


Gi) S={zeC: |z| < l,imz > 0} 

(ii) S= {ze C:1<|z| <2,imz> 0} 

Gil) S={zeC:0<rez< 1,0 <imz< 1} 
(iv) S={zeC:1 < |z| < 2,0 < imz < rez} 


14. In the construction of a space-filling curve in Section 2.9, why can we not just take 


Yn to be a path that zigzags n times across U?, as in Figure 2.29? Justify your answer. 


15. Let U? be the unit cube in R*. Show that there exists a continuous map from [0, 1] 


onto U?. (Hint: the hard way is to construct paths with similar properties to those in 
Section 2.9. The easy way is to observe that U? =U? x [0,1]. Map [0, 1] x [0, 1] 
onto U* x [0, 1]; then compose with y : [0,1] — [0,1] x [0, 1].) 
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16. 


17. 


18. 


19. 


Let U” be the unit hypercube in R”. Show that there exists a continuous map from 
[0, 1] onto U". 


Let m,n > 0 be integers. Prove that there exists a continuous map from U” onto 
U", even when m < n. 


If yn is a sequence of step paths in U, can the union of their tracks equal U (rather 
than just being dense in U”)? Prove your answer correct. 


Does there exist a continuous map y : [0,1] —> U? that is one-one as well as onto? 
Justify your answer. 


3.1 


Power Series 


Many of the more important functions studied in real analysis, such as the exponen- 
tial and trigonometric functions, are most conveniently defined using power series. The 
familiar power series for the sine and cosine go back at least to around 1400 with the 
work of Madhava of Sangamagra, whose discovery of these series was reported in the 
1501 Yuktibhasa of Jyesthadeva. This book also included Gregory’s series 


for the inverse tangent function: 


The first term is the product of the given sine and radius of the desired arc divided by the cosine 
of the arc. The succeeding terms are obtained by a process of iteration when the first term is 
repeatedly multiplied by the square of the sine and divided by the square of the cosine. All the 
terms are then divided by the odd numbers 1, 3,5,.... The arc is obtained by adding and 
subtracting respectively the terms of odd rank and those of even rank. 


Both Madhava and Jyesthadeva were members of the Kerala school of Indian mathe- 
matics. The series were rediscovered in the West in 1676, when Newton stated them in 
a letter to Henry Oldenburg, Secretary of the Royal Society. They were derived again 
by Abraham de Moivre in 1698, by James Bernoulli in 1702, and were widely used by 
Euler in the 1730s. 

Power series are, if anything, even more important when we move from real to com- 
plex functions. In the 1820s Cauchy made considerable use of power series )> ayz" of a 
complex variable z. In particular, any real function having a power series development 
automatically gives rise to a complex function with the corresponding power series. This 
provides a natural method for extending real functions to the complex case. In the 1840s 
Weierstrass showed how to base the theory of complex functions on power series. 

In this chapter we develop some elementary properties of sequences and series of 
complex numbers, mostly by direct analogy with the real case, and then specialise to a 
deeper study of power series. 


Sequences 


For our purposes, sequences are required only as a stepping-stone towards series. A 
(complex) sequence is a function 


f: NOC 
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where as usual N denotes the natural numbers {0, 1,2,...}. It is convenient to write 


Zn =f (n) 
and to arrange the numbers Z,, called the terms of the sequence, in order as 
£Os 215 225 0-04 Zp os 
Alternative notation 
(Zn) (n= 0) 
or just 
(Zn) 


are often used for brevity, and it is sometimes helpful to start the sequence at | instead 
of 0, like this: 


Z15 225235255 faye 
Now the corresponding f maps N \ {0} to C. 


DEFINITION 3.1. A sequence (z,,) tends to the limit z as n tends to oo if, given any real 
€ > O, there exists a natural number N(e) such that 


n > N(e) implies |z, — z| < € 


A sequence that tends to a limit is convergent. 


This definition is identical to the usual one for real numbers, except that z, z, may be 
complex, and the absolute value is as defined for C. 
We write 


lim z, =z 
n—-> Oo 


or 


Zp > ZasSn—> Co 


The geometric content of this definition is that for sufficiently large n, all terms z, lie 
inside arbitrarily small discs centred on z, as in Figure 3.1. This again is reminiscent of 
the real case. 


Figure 3.1 A complex sequence converging to a limit. 
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The problem of finding limits of sequences of complex numbers can be reduced 
directly to the real case: 


LEMMA 3.2. Let (z,) be a sequence of complex numbers, with Zz = Xn +ivn, An.Yn € 
R). Let z= x +1y (% y, € R). Then 


lim zn =z 
no 
if and only if 
lim x,=x and lim y,=y 
no no 


Proof. Suppose that limy+oo Z, = Zz, and let € > O. There exists N € N such that 
lf — z| < e ifn > N. For any w = u +i we have 


wis Ve+P=|wl Whe vw+v? = [Wl 
Therefore, taking w = z, — z, for all n > N we have 
Xn —X| < |@n —2z| <€ 
yn — Yl S la — 21 <é 


SOX, > x andy, —> y. 
Conversely, suppose that x, — x and y, — y. Givene > 0 there exist M,N € N such 
that 


Xm —xX| < €/2 for allm > M 


lyn — y| < €/2 foralln > N 
Let R = max(M, N). If m > R then 


en — Z| S Wn — x1 + lyn —y| S €/2+€/2=€ 


SO Z — Z. 


Since (xn) and (yn) are real sequences, their limits x and y are also real and can be 
found using the techniques of real analysis. 


Example 3.3. Let z, = (n in? i, Does zn, converge? If so, to what? 
Separate z, into real and imaginary parts, thus: 


Z, = (nt+in* +1)7! 


= (n+ Din+1)* +n! —in* (M41)? +7°] 


using (1.8) of Chapter 1. Then 


n+1 

ame 1p Z > Oasn— co 
_— nr 

m= G Pe 7 > Oasn— oo 


SO 


Zp > 0+1-0=0 
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A similar idea gives the complex version of the General Principle of Convergence: 


THEOREM 3.4 (General Principle of Convergence). A sequence (Z,) tends to a limit z 
asn—> oo if and only if for alle > 0 there exists N(e) € N such that 


m,n > N implies |Z, — Z,| < € (3.1) 


Proof. First, let z, — z. Then there exists N = N(e) such that |z, — z| < ¢/2 for all 
n> WN. If m,n > N then 


m= <al = nmi + k—- el a ef2+ef2=e 


so (3.1) holds. 
Conversely, assume (3.1) holds. Let x, = re(Z,), Yn = 1m(z,). If m,n > N then 
Xin — x, = Zin ~~ Znl <€ 
lym — Ynl S lZm — Zn| < € 
By the General Principle of Convergence for real sequences (that is, the same statement 


for a real sequence using the real absolute value), it follows that x, — x and y, — y for 
some x,y € R. By Lemma 3.2, z, > z=x+iy. 


Example 3.5. Use the General Principle of Convergence to show that the sequence 


defined by 
= 3—4i\" 


converges. 
Compute 


_ 3-—4i1\” 3—4i\" 
2m — Zn| = 6 6 


where r = min(m,n). Since 7 < 1 we can make this less than any given ¢ > 0 by taking 
r large enough. 

Note that trying to tackle this question using Lemma 3.2 directly does not work 
out very easily, although the intrepid reader who converts (3 — 41)/6 into polar coor- 
dinates stands a better chance). It may of course be shown from the definition that 


Zn > in/2. 
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Given a sequence (Z,), we can form another sequence of partial sums defined by 


n 
=e =stiate +z 
r=0 


If s, tends to a limit s € C, we define 
CO 
ye =s= lim sy, 
n—->OCo 
r=0 


and say that the series 


converges to Ss. 
A series that converges to some s is said to be convergent. We often also write 


S=Mr™Wy+rwZyr-:: 


when convenient, but note that this is a definition of the + --- notation, since infinite 
additions do not have any meaning until we give them one. 

A series that is not convergent is called divergent. 

By way of this definition, any question about the convergence of a series can be 
turned into one about the corresponding sequence (s,) of partial sums. For example, 
Theorem 3.4 applied to (s,,) yields: 


LEMMA 3.6. A series )--°. z; converges if and only if, for all ¢ > 0, there exists 
N(e) € N such that 


n 
m,n > N(e) implies » Z| <€ 
r=m+1 


Proof. Observe that eer Zp = Sy — Sm. Now use Theorem 3.4. 


We can replace m+ 1 by m here, by working with N + 1 if necessary. This is usually 
more convenient. 


COROLLARY 3.7. lip pat Z, converges then z, > Oasr > o. 


Example 3.8. Let 


CO 
Does > 9 2, converge? 
We have 
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This series is a (finite) geometric series, whose sum is known to be 


(-@" 0-8) 


which obviously may be made less than any required ¢ > 0 by taking m,n large enough. 


WARNING 3.9. There is an ancient, venerable, and politically incorrect joke about two 
country yokels who each buy horses and want to be sure which is which. So they cut the 
tail off one. Next day, someone has cut the tail off the other. So they try again with an 
ear... with the same result. Finally, in desperation, one says to the other “Tell you what: 
you take the black one and I’ll take the white one,’ 
For some reason, many students become confused between sequences and series. 
There is no reason to fall into this trap, because: 
ee) 
series are the ones that begin with > 
r=0 


This does not eliminate all sources of confusion, but it’s a good start. 


As with sequences, it is sometimes preferable to start at 1 rather than 0. The series 
then takes the form 


lo) 
ye &r 
r=1 
or 


ZytateBte: 


whose precise definition is left to the reader. Variations, such as 


[o,2) 


se 


r=3 


are also feasible. However, this is no more than )>7°,y z- where we set zo = z1 = 72 = 0. 
To simplify notation, we often abbreviate such sums to 


de 


when the limits of summation are clear. 
The summation of complex series may, if desired, be reduced to equivalent real series. 
Applying Lemma 3.2 to the partial sums, we obtain an immediate proof of: 


LEMMA 3.10. Let z- = x; + ivr where x,, yy € R. Then > z; converges if and only if 
both real series }- y, and > y, converge. If so, 


Dae = Sixt iy 
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Example 3.11. 11 Let z, = (i)”/n?. Does > ~z, converge? 
We have 
es 0 if n is odd 
"| (-1"/2/r2 ifn is even 
Ignoring the zero terms, > x, is an alternating series (its terms are alternately positive 


and negative) whose terms tend to zero, so it converges. 
Similarly, y, = im(z,) yields a convergent alternating series. Hence )~ z; converges. 


In the same manner, considering real and imaginary parts, or by mimicking the usual 
proof for real series, we get: 


LEMMA 3.12. Let }\ a; and )_ b; be convergent series, and let c be a complex number. 


Then 
Yar + b;) 2 oe ob, 
a Cay =C > dy 


When verifying that a complex series converges without computing its actual sum, it 


is often better to concentrate on the modulus rather than the real and imaginary parts. 
By analogy with the real case, we introduce: 


DEFINITION 3.13. A series )>z, is absolutely convergent if and only if the series 
>- |z,| is convergent. 


THEOREM 3.14. An absolutely convergent series is convergent. 


Proof. Let >> z, be absolutely convergent and let z, = x, + iy;y. Then |x,| < |z,| and 
ly-| < |z,|. But }°|z,| is convergent, so by the comparison test for real convergence, 
>= |xr| and 5° |y,| are convergent. Since absolute convergence implores convergence in 


the real case, }> x, and )° y; are convergent. The result follows by Lemma 3.10. 


Example 3.15. )°z, = > (i”/n7) converges because )*|z,| = > (1/n?) converges. 
(Compare this proof with the previous example.) 


Theorem 3.14 is useful; because it lets us verify convergence of many complex series 
> zr by reference to the real series > |z|. Since the latter has positive terms there is a 
complex version of: 


THEOREM 3.16 (Comparison Test). Let >> a; and >> b; be complex series, with )~ ay 
absolutely convergent. If 


lb;| < Kla,|_ (r > N) 


for some positive K and integer N, then )° b, is absolutely convergent, hence convergent. 
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There is also a complex version of: 


THEOREM 3.17 (Ratio Test). Let doar be a complex series with non-zero terms such 
that 
lar | 


roo |r — ]| 7 


If. <I, the series )° a, is absolutely convergent. If i > 1, the series is divergent. If 
A. = 1, it may be convergent or divergent. 


Proof. For X < 1, let p = S(a +1). Then A < p < 1 and there exists N such that 
laI/lail<p (>N) 
Hence 
lar] < plar—1| < p* lara] <--> <p" “lanl >) 


and )\ a, converges by comparison with > p’ for real p < 1. 
In the case A > 1, we see that for some N 


lar|/la-11> 1 © >N) 


By Corollary 3.7, )> a, cannot converge because the terms a, do not tend to zero. 
The final case 4 = 1 applies to }>1/r and > 1/r’. The first diverges, the second 
converges. So both possibilities may occur. 


Power Series 


We can now give a formal introduction to one of the central ideas in complex analysis: 


DEFINITION 3.18. Let zp € C. A series of the form 


Y= an(z — 20)" (3.2) 
n=0 


with coefficients a, € C is a power series about zo. 


By the change of variable z’ = z—zo we can usually reduce properties of power series 
to the case zo = 0. Here, convergence is governed by the following results: 


THEOREM 3.19. (i) If a power series )\anz" converges for z = 21 # O, then it 
converges absolutely for all z with |z| < |z,|. 
(ii) [fa power series ) ayz" diverges for z = z2, then it diverges for all z with |z| > |z2]. 


Proof. (i) If a power series )> ayz" converges then |a,z"| > 0 as n — ov, by Corol- 
lary 3.7. Thus there exists K € R such that |a,z"| < K for all n. If |z| < |z1| then 
q = |z/z1| < 1. Now 


lanz"| = |anzi\|z/z1|" < Kq” 


so by the comparison test, )* |anz”| converges. 
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(ii) If |z| > |z2| and }°a,z” converges then by (i), })a,z5 also converges, a contradic- 
tion. So }° anz" diverges. 


These results lead to an important concept: 
DEFINITION 3.20. Let 
R = sup{\|z| : there exists z such that |a,z"| converges} 
(allowing R = oo if no real supremum exists) then it follows at once that 


ya yn | converges for |z| <R 
us diverges for|z| >R 


(We cannot yet say what happens for |z| = R.) We define R to be the radius of 
convergence of the series, and the set 


{zeC: |z| < R} 


is the disc of convergence, classically often called the circle of convergence because 
geometrically it is the interior of a circle centred at the origin. In extreme cases this may 
be just the origin when R = 0, or the whole of C when R = oo. 


In the general case (3.2) where zo is arbitrary, we apply the same definition with z 
replaced by 7 = z— zg. Now the conditions 


via (e—25" converges for |z —z| < R 
is . diverges for |z—zo| >R 


determine the radius of convergence R, and the disc of convergence is 


{zeC: |z—z| < R} 


Example 3.21. The series 1 + z+z?+--- is convergent for |z| < 1, since it is absolutely 
convergent in that case. It diverges for z = 1 and hence for all z with |z| > 1. Therefore 


its radius of convergence is 1. 


When |a;|/|ar—1| tends to a limit as r tends to infinity, the radius of convergence can 
be computed as follows. Let 


. a 
lim lar| l 
r>00 |a,_1| 
(which is positive or zero). Then 
r 
: Z 
lim lane | = 1 | 


By the ratio test, for / > 0 the series is convergent for /|z| < 1 and divergent for /|z| > 1. 
Hence the radius of convergence is 1 //. Put another way, the radius of convergence of 


> az" is 


< |ar—1| 
lim 
roo |a,| 


provided this limit exists. If / = 0 then the radius of convergence is 00. 
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Example 3.22. If a, = 1/n so }\ anz"” = )° z"/n, then 


|ay—1 | = 


im 
roo la;| 


1 


so the radius of convergence is 1. 


In general, the ratio |a;|/|a;—1| may not tend to a limit. Sometimes it is possible 
to spot the radius of convergence by native wit, but when all else fails we can use a 
powerful technique that works in all cases: 


THEOREM 3.23. The radius of convergence of )~ dnz" is given by 
1/R = limsup |a,|!/” (3.3) 


Proof. Define R using (3.3). 

Suppose first that |z| < R. Then we can choose p such that |z| < p < R. By the 
definition of lim sup, we have 1/p > |a,|!/” for all n larger than some N. Hence |a,| < 
1/p", so 


lanz”| = |anp"| - |z/p\" < |z/pl” 


Now |z/p| < 1 so yew ele converges. By the comparison test, Sa Anz" 
converges, 80 )>°°.) dnz” converges. 
Next suppose that |z| > R. Choose p such that |z| > p > R. Then 1/p < |a,|!/” for 


all n larger than some N. By comparison with )> |z/p|", os ayz" diverges. 


Example 3.24. The series 
Zz 
yop" 
n 
has radius of convergence R, where 
1/R = limsup(1/n)!/”" = 1 


so R = 1. The series converges for |z| < 1 and diverges for |z| > 1. When |z| = 1 
further analysis is required, for which we do not yet have the technique. 
The series }> a has radius of convergence R, where 


1/R = limsup(1/n!)'/” =0 


Therefore R = oo and the series is absolutely convergent for all z € C. 
The series 
2n 2n+1 
! 


Ae 4. 
Ly =) al 


similarly have R = oo, so they are absolutely convergent for all z € C. 
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In the next chapter we use the last three power series above to define the complex 
exponential, cosine, and sine functions. In order to derive some of their basic properties, 
we will use a theorem on the multiplication of power series. Roughly, this says that if 
we multiply them ‘term by term’ and collect terms in the right way, we get a series 
converging to the product. More precisely: 


THEOREM 3.25. Suppose that )~ dy and )° by are absolutely convergent, with sums 
a, b respectively. Let 


Cr = agby + ayb;-1 + +++ + aybo (3.4) 
Then )~> Cy is convergent, and its sum is ab. 


Proof. This is a direct extension of the corresponding result in real analysis. For readers 
unfamiliar with this theorem, we give details in Section 3.5. 


Manipulating Power Series 


The results derived so far let us calculate with power series ‘as if they are infinite poly- 
nomials’, provided they are absolutely convergent. To see this, let }° anz” and > bnz” 
be power series with radii of convergence Ry, Rp respectively. Let |z| < min(Ra, Rp). By 


Lemma 3.12, 
> anz” + s baz” = YoGn + by)z” 
By Theorem 3.25, 


62 an2") o> bn”) = Ge 


where c; is defined by (3.4). If we replace the infinite sum by a finite one, )~1, these 
formulas become the usual ones for addition and multiplication of polynomials. 

It is this feature that makes power series so useful: we can calculate with them 
relatively easily. In fact, we can use familiar algebraic methods. 

We take advantage of this to exhibit some important features of the complex 
exponential and trigonometric functions. 


DEFINITION 3.26. The complex exponential and trigonometric functions are defined 
by three power series: 


expz = — 
n! 
2n 


cosz = ey om 


; P Zntl 
sinz = Y(-D nad! 


These series are motivated by the corresponding real series. They are absolutely 
convergent for all z €¢ C by Example 3.24. 
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We begin with the expression cos @ + 1sin@ of Section 1.7. Adding the power series, 


we obtain 
cos@ +isin@d = > CrO" 
where 
(-1y? 
Cr = if r is even 
r! 
(—1)0-D/2 
Pe a ee, 
r! 
Now 2? = —1,i° = —i,i* = 1, so 
iv = (-1)"? if r is even 
iv = i(-1L0-Y/2 if r is odd 
Hence 


or 


—_ ro ; 
y c,O" = y ee 
We thus have the important formula 


cos 6 + isin@ = expid 


Therefore the polar coordinate form z = r(cos 8 +1isin@) for a complex number z can 
be written as z = r- expiO, or more briefly as re!?. (We define powers z“ of complex 
numbers in Section 7.3, and this will justify the notation expid = e ) 

Next, we apply the formula for the product of two power series to evaluate 


expz -expw 


where z,w € C. We obtain 


expz: expw 


] 
oN 
M 
eaiee 
——"" 
NN 
Os 
=| =, 
“——" 


n=0 \r=0 
Ce n 
- ys 7 ( c ) ww) 
! r 
n=0 r=0 
By the Binomial Theorem this is 
— 1 
» te tw)" = exp + w) 
n=0 — 


This proves 


expz: expw = exp(z + w) 
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Products of Series 


We sketch a proof of Theorem 3.25 above: if }* a, and )~b, are absolutely convergent, 
with sums a, b respectively, then ab = )° cn, where 


Cy = dob, + ayb,_1 + +++ + a,bo 


The proof may be easier to follow using Figure 3.2, which represents all possi- 
ble cross-products ash; on a square grid. The idea is that partial sum of )> cy is the 
sum of terms in a triangular region like the one shown in light shading. In contrast, 
products of partial sums of }°a,z" and )° b,z" correspond to terms in a rectangle, 
such as the one shown in dark shading. Our main task is to estimate the sum of 
terms in suitable triangular regions by approximation them by rectangles. Here are the 
details. 

Let }° |a,| = A, >> |b,| = B. Given e€ > 0 we can choose N large enough so that all 
of the following conditions hold. (To do this, choose an N for each condition and take 
the maximum of the three Ns chosen.) 


N N 
. E 
(i) (xs) (:5.) ab| < Peay Ay 
n=0 n=0 
au € 
(ii) > |a,| < ALBal 
n=N+1 
eal E 
ee b 
ay” Vl Saat 
n=N+1 


4 

— 
*e 
*e 
. a 
oO o 
ao . 
. . 
° . 
. . 
oO . 
oo a 
. . 
a . 
2 . 
O a 
oO a 


eeeee#e#e#se##ee#eee*e @# 
eeeee#ee##ee#e*#eee @ 
e*eeee#eesee#tee#e#e 
eeee#eee#e#e#e*#*#ee#eee@ 
eeeee#eeer°*°*e «© #® @ @ 
*eee#ee##e#s###*#ee#eee ee 
«ee ff #8 @ @ @ @ @ @ 
eeeee#ee###eee @ 
ee o . ca o ese — . a . 
oeeefe#eogee#e#eeestee @ 
ee . . oo . o o . °° o . 
ee e ion o a o o e o . a 
ee o °° o oO oO a o ° oO . 
oS) = . ° . 2 oO e . ° a oO 
ese e oa a 2 oO . o . a . 


<a 
7) 
= af 


Figure 3.2 Terms of >> a,z", >> bn z’, and their product. 
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Then 


Se, aa < Se, = (ds) (>) 


n=0 n=0 


|/N N | 
E-)e)-« 
n=0 n=0 
which from Figure 3.2 is less than or equal to 


2N N N 2N P 
5" lan (311) + (dota Ste 
n=N+1 n=0 n=0 n=N+1 

éB éA 


< 
A+B+1 A+B+1 


€ 
=€ 
A+B+1 


Therefore ) °c, converges to ab. 


Exercises 


1. Determine whether the following sequences converge, and find the limits of those 
that converge. 


@ (1+i") 

(ii) (1 + i)”/n) 

(ii) ((1 + i)"/n!) 

Gv) (/d +1)") 

(v) @/( +1)") 

(vi) (n!/(1 + i)”) 

2. For what values of z € C does each of the following sequences converge? 

(i) @") 

(li) (2"/n) 

(iii) (n!z") 

(iv) (z”/n!) 

(v) (2"/ n*) where k is a positive integer. 

(vi) (a(a— 1)---(a—n-+1)z"/n!) where a is a fixed complex number. 


3. Let a € R have decimal expansion 
A= d0.a{a...dn... 


where each a, is an integer and 0 < a; <9 for n > 1. Find all values of a for which 
the sequence (a,) converges. 


4. Let z, = ii" + (-i)”). Write down the first few terms of the sequence (z,,). Derive 
similar expressions for the nth terms of the sequences that begin as follows: 


Gi): 40,0,/0,1,0,0;,.0;1,0:0,0;1,0,0, 0,5; 
(ii) 1,0,0,0, —1,0,0,0, 1,0, 0,0, —-1,0,0,0, ... 
Gii) 1,0,-—5,0, $,0,—7,0, ... 

(iv) 1,0, 15,0, 3,0, 13,0, ,0,12,0, ... 
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5. Let (u,) be a convergent sequence in C. Let v, = oem u,)/n. By writing v, = 
v), + vi where 


v= = Sou wae >. Ur 


r<J/n Jn<r<n 
show that (v,) converges to the same limit as (u,). 


6. Find the radius of convergence of the following series: 


(i) Doz"/n 
(ii) }> z”/n! 
(iti) S° n!z” 
(iv) }° nz” where kis a positive integer. 
Wee 
7. The Oth order Bessel function Jo(z) is defined by the power series 
ee) 1 n 
Jo) = Le ener 


Find its radius of convergence. 


8. Find the radius of convergence of the following series: 
3 5 7 


‘ Zz Zz z 
O27 TF 
2 4 6 
Nie v4 Zz Zz 
tor at 6 
2 3 4 
baa Zz L 
(iii) z ae 
| a2 fi<acla= 1 
(iv) 1+ az le Om cs ) - ial 1) 42 cen gee 


(Note that in part (iv) the radius of Gonvenenee may differ for different values of a.) 
9. Show that )°7°, 2" converges for |z| < 1, but diverges for infinitely many z with 
z| = 1. 
10. Suppose that )° az” has radius of convergence R and let C be the circle {z € C: 
|z| = R}. Prove or disprove the following (which may or may not be true): 


(i) If }° a,z”" converges at some point on C then it converges everywhere on C. 
(ii) If }* anz” converges absolutely at some point on C then it converges absolutely 
everywhere on C. 
(iii) If }° a,z" converges at every point on C, except possibly one, then it converges 
absolutely everywhere on C. 
(Hint: the series }* z”/n could prove useful in this question.) 


11. If > ayz" has radius of convergence R, use the formula 1/R = limsup lan|!/” to 
find the radius of convergence of: 


(i) oe nw Anz" 
(ii) >> anor 
(iii) )> az" 
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12. 


13. 


14. 


15. 


Prove that if each of the series }a,z",>°b,z', and }\a,b,z" has radius of 
convergence equal to 1, then so have the series }° an bez" and )> a2 bz". 


For |a,| < 1, show that }>a,z" is absolute convergent for all |z| < 1. If }> a,z" = 
F(z) for |a,| < 1, |z| < 1, show that 


1 
If(| < ia 


Prove that, for z 4 1, 


ee < 1 eh og {—4 
Y2-55(Sam as *] 


n=1 n=1 n=1 


Show that the series }“P°., z”/n and S~P°, z”/(n(n + 1)) have radius of convergence 
1; that the latter series converges everywhere on |z| = 1, while the former series 
converges everywhere on |z| = 1 except z= 1. 


Suppose that the power series )*?° 9 daz” has a recurring sequence of coefficients; 
that is, dn4x = an for some fixed positive integer k and all n. Prove that the series 
converges for |z| < 1 to a rational function p(z)/q(z) where p, g are polynomials, 
and that the roots of g are all on the unit circle. 

What happens if an44 = Gn/k instead? 


4.1 


Differentiation 


The derivative of a real function is defined by a limiting process, which generalises 
without difficulty to complex functions now that we have developed the necessary con- 
cept of a limit. There are few surprises in this chapter: the results on differentiation of 
sums, products, composites of functions, and power series parallel the real case — and 
even the proofs are essentially unchanged. One minor surprise is that the condition of 
differentiability implies certain relations between the real and imaginary parts of a com- 
plex function, specified by the Cauchy—Riemann Equations. We apply these equations 
immediately to prove that a function on a domain has zero derivative, it must be constant. 
While the result is analogous to the real case, the proof is not. The idea of differentiation 
can also be extended to hybrid functions R > C and C — R, still without surprises. 

To counter the impression that all the results of real analysis go through without 
change to the complex case, the final section previews a dramatic difference between 
the two theories: every differentiable complex function can be differentiated arbitrarily 
many times. 


Basic Results 


By analogy with the real case we state: 


DEFINITION 4.1. (a) A complex function f defined on an open set S is differentiable 
at zo € Sif 
_ £(@) — f(z) 
lim ———— 
> %H Z— Zo 
exists. 
(b) The value of this limit is then defined to be the derivative f'(zo) at that point. 
(c) Iff is differentiable at every zo € S thenf is differentiable. In this case the derivative 
can also be considered as a function f’ : S > C. 
(d) If f’(z) is also differentiable, we define the second derivative to be 


f f'@ —f' @o) 
im ————_ 
=H Z-%H 


7 oS : 


(e) Repeating this process we obtain the usual notion of higher derivatives f” (zo), 
Pas ea fa) at a: 
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Example 4.2. Suppose f(z) = z?. Then 


ee 
tim LO =O) _ jig 27H _ 220 
Z>20 Z— ZO 2720 Z — ZO 


Hence f’(zo) = 2zo for all z € C. Similarly f’”(zo) = 2 and f™(zo) = 0 for all 
n> 3. 


Several alternative notations are used of differentiation, the two most popular being 
Df (z) or df(z)/dz in place of f’(z). The second derivatives are then denoted by D2f(z) 
and df (z) / dz” respectively. Sometimes (especially in classical texts) f(z) is denoted by 
w and its derivative by dw/dz. In this text we use the notation f’(z) because, when we 
think of the derivative as a function in its own right, we can then denote it by f’. The 
same goes for higher derivatives f” and so on. 

In many ways, results about complex differentiation follow naturally by analogy with 
the real case, as the next few results illustrate. 


PROPOSITION 4.3. Iff is differentiable at zo, then f is continuous at Zo. 
Proof. 
(Z— zo) =f’ (Zo) -0=0 


f(z) —f Zo) 


lim f(z) — f(Zo) = lim 
l>% 27H £— £0 


Thus differentiability at zo implies that 
lim f(z) = f(Zo) 
Z—Z0 


which is one formulation of continuity. 


Recall from Section 2.3 that the sum f+ g, difference f — g, product f- g, and quotient 
J/g of two functions f : S > C,g : S — C are defined in the usual manner: 
F+HNO=fz+s@) ES) 
F-gW=f@—-—se) eS) 
(f g@=f@s2 (eS) 
(f/8)Z) =f@/s@) j@eS,g@) #0) 


We obtain the expected rules for differentiation of these combinations: 


PROPOSITION 4.4. Iff and g are differentiable at zo, then so are f + g8,f —g,f-g, and 
I /g (in the latter case, provided that g(zo) # 0). The derivatives are: 


(i) F+9) =f +28’ 

(ii) ff — 9) =f'-8' 
(iii) (f- 9)’ =f-e'+f'-g 
(iv) (f/8) = (f'g —fa’)/(g”) 


Proof. The computations are analogous to the real case. As an example, we prove (iii): 
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f(D 82) —f(zo)8(Z) 


(f+ 8)'@o) = lim 


— 20 Z— ZO 

iy f(Os@) — f@sg (zo) +f @sgZo) — f(zo)g(zo) 
z>20 Z— 2% 

= lim f(2 lim g(z) — (Zo) ee f(2) — fo) 
Z>2% 220 £ — X%o I> XH £ — Xo 


using the algebra of limits, which equals 


f (20) 8 (Zo) +f’ (Zo) 8 (Zo) 


because Proposition 4.3 implies that f is continuous at Zo. 
Parts (i, ii, iv) can be proved by similar methods. 


COROLLARY 4.5. (i) Let 
P@) = ay + a Z++++ + ay2" 


be a polynomial with complex coefficients a,. Then its derivative is 


p (2) =a, + 2anz +--+ +nayz”! 


(ii) Let p(z), q(z) be polynomials over C. Then 


d p(z) _ p'(z)q(z) — p(z)q'(z) 
dz q(z) q(z)? 


whenever q(z) # 0. 


Proof. We prove by induction that 


d n n—-1 
— 7? — 4.1 
dee nz (4.1) 
Everything else then follows from Proposition 4.4. 
Clearly (4.1) is valid for n = 0. When n = 1, let f(z) = z. Then 


Z— £0 


f'(zo) = lim =i 


I> 2 Z— ZO 


and again (4.1) holds. For the induction step, part (11) of Proposition 4.4 yields: 


d n+l d n —l n 
= 7) ‘ -l=(n4+1 
Zz re (z" -z) = nz" Z+Z (n Zz 


As usual, we denote the composition of f : S + C and g: T > C, where f(S) C T, 
by gof: 
sof) = gf) 
We then obtain: 


PROPOSITION 4.6 (Chain Rule). If f is differentiable at zo and g is differentiable at 
F(Z), then g of is differentiable at zo and 


(g of)'(zo) = g(f(zo))f (zo) 
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Differentiation 


Before giving a proof, consider a common attempt to prove the chain rule, by writing 


g(f(z)) — s(fo)) — gf) — 8(f@o)) f@ — fo) 
Z—% F(Z) —f Zo) Z— £0 
provided that f(z) ~ f(z). Since f is differentiable at zp it is continuous there, so z > Zp 
implies f(z) > f (Zo), giving 
im S6f@) = 8F@o)) _ 
22 f(z) —f(Zo) 
Letting z > zo in (3.1) then gives (g o f)'(zo) = g'(f(zo))f’ (zo) as required. 
Unfortunately this attempt has a nasty gap, because f(z) — f(z) could be zero. If we 
knew it was non-zero in some neighbourhood of zp, excluding z = zg, we could work 


(4.2) 


g'(f (Zo)) 


in this neighbourhood and the proof would then be valid. In fact Theorem 10.16 below 
shows that in the complex case such a neighbourhood always exists — an interesting 
example of the extra simplicity of complex analysis. In the real case, no such theorem 
holds. 

However, the proof commonly used in the real case to patch up the gap in the 
above attempt also works in the complex case, and is much more elementary than 
Theorem 10.16. It goes like this: 


Proof. Let u = f(zo). Define 


hw) = —g(u) ifwAu 


h(u) = 0 


g(w) — gu) 
i 


Clearly 4 is continuous and defined for all w sufficiently near u. Also, as z > zg we see 
that h o f(z) > ACf(z)) = hu) = 0. Now, the definition of h, applied when w = f(z), 
can be written in the shape 


g(f(z)) — gw) = (A(f(2)) + 8(W))(F) — u) 


when f(z) 4 u. But it is also obviously true when f(z) = u. Let z # z, divide by both 
sides by z — Zo, let z > zo... Voila! 


The Cauchy—Riemann Equations 


Suppose that we write a complex function f in terms of two real functions u,v of two 
real variables 


f(@) = u(x, y) + v(x, y) 


where x + iy = z. We now prove that differentiability of f imposes two conditions on 
the partial derivatives of u and v with respect to x and y. We use the notation 


u(x + h,y) = u(x, y) 


0 Me )= Ii 
x,y) = lim 
Ox * h 


—>0 h 
ee, = lim u(x, y + k) _ u(x, y) 
dy k—0 k 
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and abbreviate these to du/dx and du/dy when no confusion can occur. We can then 
prove: 


THEOREM 4.7 (Cauchy—Riemann Equations). If f is differentiable at z = x + iy then 
du/dx, du/dy, dv/dx, dv/dy all exist at (x,y), and 
du ov ov ou 


= = 4, 
ox dy ox oy sen 


Proof. We calculate f’(z) in two different ways. First we take a point near z in the form 
zth=(«+h)+1), where h is real, and compute 


f(zt+h —f@ 


/ = ij 
f@ 10 h 
ee u(x + h,y) + iv(x +h, y) — u(x, y) — iv, y) 
h-0 h 
. u(x + h,y) — u(x, y) a y's v(x + h, y) — v(x, y) 
= lim +i lim 
h>0 h h->0 h 
du dv 
— 1— 
Ox ox 


Next we take a point near zin the form z = x + i(y + k), where k is real, and compute 


u(x,y +k) + iv, y + k) — u(x, y) — iva, y) 
h 


/ = li 
e= fm 
ov .ou 
= — —-i— 


~ ay ay 


Now equate real and imaginary parts in the two expressions. 


The equations (4.3) are called the Cauchy—Riemann Equations after Cauchy (1789- 
1852) and Riemann (1826-1866). However, they were known to d’Alembert in 1752. 
Observe that one equation has a minus sign but the other does not. This happens because 
i? = —1anditis an important feature of the Cauchy—Riemann Equations. We can check 
this using the simplest nonlinear complex function: 


Example 4.8. Suppose that f(z) = 2” = (° — y*) + 2ixy, so that 


u(x,y)= x —y v(x,y) = 2xy 


Then 

Ou 
oe Oy 
Ox 

du 5 
ay y 
ae 
ax - 
Lae 
a 


and the Cauchy—Riemann Equations are valid. 
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Example 4.9. The function f(z) = rez is continuous everywhere but differentiable 
nowhere. In this case u(x,y) = x, v(x, y) = 0. So du/dx = 1,dv/dy = 0, which are 
never equal. Analogous real functions are much harder to find, see Section 4.6. 


Example 4.10. The converse of Theorem 4.7 is false. Let 


0 if one or both of x, y is zero 
1 if neither of x, y is zero 


farin={ 


For this function, the partial derivatives of u and v all exist at the origin and are zero, 
so the Cauchy—Riemann Equations are certainly satisfied. But f is not continuous at the 
origin, so it cannot be differentiable there. 


Once again the introduction of real analysis creates complications. However, in this 
case it is relatively easy to patch things up. It is rather like starting a well-tuned vintage 
car on an icy morning. Once it is going, it will run smoothly, but getting it started can 
take hard work. Complex analysis will prove to be a well-oiled machine, but taking 
real analysis as a point of departure requires paying careful attention to the starting 
conditions. If we do that, the machine runs well. But to verify this claim, we have to 
take a very close look at the mechanics. We begin with a technical lemma. 


LEMMA 4.11. If du/dx and du/dy exist at (x,y) and du/dx is continuous there, then 
ou ou 
u(x+h,y+k)—u(x,y)=h (Sey) (hk) +k (Fes Dr n(hk)) 
where e(h,k), n(h, k) > Oas h,k — 0. 
Proof. Write u(x +h,y +k) — u(x, y) as 


u(x +h,y +k) — ux,y +k) + u(x, y +k) — u(x, y) 


By the Mean Value Theorem for one real variable applied to (4) = ux +1, y +4), there 
exists 9 with 0 < 6 < 1 such that 


a) 
u(xth,y +k) — ux,y4 ky = h(x +t 0™h,y +k) 
x 


By continuity of du/dx, 
Ou 
ox 
where e(h,k) > Oas h, k — 0. Hence 


(x+ Oh, y +k) Oe = e(h,k) 
Ox 


u(xth,y +k)—utx,y+k) = h (oxy) + e.0)) (4.4) 


More simply, k —> 0 implies 


u(x, y +k) — u(x, y) Ou 
> 
k dy 


(x,y) 


Therefore if 


ih = u(x, y+ ° — u(x, y) aes y) 
y 
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then 
0 
u(x,y +k) — u(x,y) = k (Fes y) + nh, b) (4.5) 


where n(h,k) > Oash,k —> 0. 
Adding (4.4) and (4.5) gives the required result. 


By imposing extra continuity conditions on the partial derivatives of u and v we can 
prove a converse to Theorem 4.7: 


THEOREM 4.12. If f(z) = u(x, y)+ivQ, y) is a complex function defined on an open set 
S, and at some point 79 = xo + iyo the partial derivatives du/dx, du/dy, dv/dx, dv/dy 
all exist, are continuous, and satisfy 


Ou dv dv ou 


then f is differentiable at zo. 


Proof. Using Lemma 4.11, we can write 


S( — fo) = uo + h, vo +k) + iv(xo +h, vo + k) — u(x0, Yo) — iv(xo, yo) 


n(S ) «( ) n (> ) it (= ) 
= t+ E t t + 1 +E +1 t 
oe 1 by m1 ay + 2 dy n2 


where €1,£2,1,N2 > Oash,k > 0. 
By the Cauchy—Riemann Equations, 


: ou .dv 
f(2 — fo) = (AA in (> i ) he, + kn + he2 + kn2 


( ) Ou .ov 
= i 
ea ox ox ‘ 


where p = he; + kn, +he2 +kn2, so 


f@)—f@o) du av Pp 
Z—-wm Ox dx 2Z—-ZH 


when z ¥ zo. But 


p 
Z— 2% 


|p| 2 lAllex| + |All] + lAlle2| + 1k ln2| 
VIP +12 Vie + 
< ler] + Ii] + lel + || 


Let h,k — 0. Then |p|/(z— zo)| — 0, so 


_ f(a@—-f@) du dav 
lim = + 4 
7% 7-72 ox Ox 


as required. 


Example 4.13. The function f(z) = |z|” is differentiable at the origin and nowhere else, 
since 


ux,y=r+y vay) =0 
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Hence du/dx = 2x,du/dy = 2y,dv/dx = 0,dv/dy = 0, and the Cauchy—Riemann 
Equations are satisfied only at x = y = 0 at which point the partial derivatives are all 
continuous. 


Connected Sets and Differentiability 


If f(z) = constant, then f’(z) = 0, but what of the converse? When the derivative is zero, 
does this imply that the function is constant? The answer is obviously ‘no’ for a function 
defined on a disconnected set, But it is ‘yes’ when f is defined on a connected set. Recall 
that a domain is defined to be a connected open set. We now prove: 


THEOREM 4.14. If f is differentiable in a domain D and f'(z) = 0 for all x € D, then f 
is constant on D. 


Proof. By the proof of Theorem 4.7, f’(z) = du/dx +idv/dx = dv/dy = 0 — idu/dy. 
So f’(z) = 0 implies that all four partial derivatives are zero. 
From real analysis, if ¢’ = 0 on a closed interval [a, b], then @ is constant on [a, b]. 
If L = {t+ iyo: t € [a, b]} is a line segment in D, let (4) = u(t, yo). Then du/dx = 
¢’ = 0 so uw is constant on L. By a similar argument, u,v are both constant on any 
horizontal or vertical line segment in D. Hence uy, v are both constant on any step path in 
D. But any two points in D can be connected by a step path, so f is constant on D. 


The same technique proves: 


PROPOSITION 4.15. If f is differentiable in a domain D and any one of re f,imf, or 
| f| is constant, then f is constant. 


Proof. If f = u+ iv and ref = u is constant, then du/dx = du/dy = 0. The Cauchy— 
Riemann Equations then give dv/dx = dv/dy = 0, and by the argument of the previous 
proof, f is constant on D. The case when im f is constant is similar. 

If | f| is constant then uv? + v* = c for some c € C. If c = 0 then f = 0, so we may 
assume c # 0. Differentiating, 


Ou dv 
2 rd 2ye- =0 
ou dv 
ay Ts aie => 0 
and the Cauchy—Riemann Equations give 

ou ou 

ae = "ay =0 

Pid + Pil =0 
dy ox 


Add u times the first equation to v times the second, to get 


a 
(u? + yy =0 
Ox 
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Since uw + v? = c 4 0 we have du/dx = 0. Similarly, the other partial derivatives of 
u,v are Zero, SO u, v are real constants, and f is constant on D. 


Hybrid Functions 


At this point we briefly consider hybrid functions, by which we mean real-valued func- 
tions of a complex variable or complex-valued functions of a real variable. There are 
evident notions of differentiation in both cases. For instance, a real-valued function of a 
complex variable f : D > R where D is an open subset of C may be regarded merely as 
a complex function with imaginary part zero. Such a hybrid function is a very dull fel- 
low, for if it is differentiable, its constant imaginary part implies that it must be constant, 
by Proposition 4.15. 

We fare a little better with complex-valued functions of a real variable. The most 
interesting case is f : [a,b] — C, which (when continuous) is a path in the complex 
plane. Define the derivative at to € [a,b] to be 
a f() — f(to) 

> t— 
in the obvious way (and allowing appropriate one-sided limits at the end points a and b) 
we obtain the expected generalisations of the usual properties of the derivative: 


PROPOSITION4.16. Iff : [a,b] > C, g: [a,b] — C are differentiable at t € [a,b], 
then 
(fts’O=fO+t8'O 
(f:- 39’ O=fOKM+f' OsO 


(f/3)' 0 = F's —fOs'O)/(sOY 


The chain rule involving a complex function f of a real variable appears in two guises. 
We may either precede it by a real function h, or follow it by a complex function g, to 
obtain: 


PROPOSITION4.17. Ifh: [a,b] — [c,d], f : [a,b] — D, and g:D — C, then 
(f oh)'(s) =f’ (h(s))h'(s) 
(gofYO=e%(foO/s'oO 


wherever the derivatives on the right-hand side of the equations are defined. 


The proofs of Propositions 4.16 and 4.17 follow the same pattern as the real and 
complex cases. 

We end this section by characterising the geometric meaning of the derivative y’ of a 
smooth path y when y’ is non-zero. 


PROPOSITION 4.18. Ify : [a,b] > C is a path and the derivative y'(t) exists and is 
non-zero for some t € [a,b] (including the possibility of a one-sided derivative at either 
end point) then the tangent to y exists at y(t) and a point on the tangent is of the form 
y(t) + hy'(t) for any h € R. (Figure 4.1.) 
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y(t) = x(t) + y(t) 


Ax) 


~ 


vf) 


x(7) <x 


Figure 4.1 The tangent to a path y : [a,b] > C. 


Proof. Let y(t) = x(t) + iy(t), where x and y are real functions. Then y’(t) = x’(t) + 
iy’(y(t)). The parametric function f : [a,b] > R? given by f(t) = (x(t), y(2)) has a graph 
whose tangent is (x(t) + hx’(t), y(t) + hy’() for any h € R. This is the complex number 
x(t) + hx'(t) + i( Wt) + hy’) = xO + hx) + iO + hy’) = vO + hy". 


This proposition will be of value when we seek to visualise paths that are differen- 


tiable, particularly for complex integration in Chapter 6 and in later applications. 


45 Power Series 


More exciting creatures are power series, for they will later prove to be the foundation 
for all differentiable complex functions. 
By Corollary 4.5, the derivative of a polynomial 


P(z) = ag Farzt-+++anz" 
is the polynomial 


p (2) = a + 2aozt+ +--+ nayz"} 


This suggests that if f is a power series 


(Oa > az" 
n=0 


we should have 
CO 


f'@ = dj nage" 
n=0 
If this is true, we say that f can be differentiated term by term. When is this the case? 
Certainly f(z) must be convergent. The next two results show that this is almost suffi- 
cient: in fact, term-by-term differentiation is always valid within the disc of convergence 


of f(z). 
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LEMMA 4.19. Let f(z) = >> a,z" converge absolutely for |z| < R. Then g(z) = 
> nanz"—! converges for |z| < R. 


Proof. For |z| < R, choose r such that |z| < r < R. Then, by Theorem 3.25, > ayr" 
converges absolutely and there exists K € R such that |a,r”| < K for all n. Now q = 
|z|/r is less than 1, so 


|nanz”—*| = nlanl|z/r\"— 1"! 


K 
< mgt! 
r 


But for 0 < q < | the real series )* nKq"~' converges to K(1—q)~. By the comparison 
test, )- |nanz’—!| converges, so by Theorem 3.14 the series >> nanz"| converges. 


THEOREM 4.20. A power series f(z) = > anz" may be differentiated term by term 
within its disc of convergence. That is, 


F@ = Yi naz 


Proof. By Lemma 4.19, g(z) = > na,z"—! is absolutely convergent for |z| < R. We 
must show that for |zp| < R, 


ey ies (2 =f) oe 
ZZ Z— ZO 
or equivalently that 


Pe (2 — fo) - (a) 0 


Z>%H Z—Z0 


Taking our courage in both hands we compute 


£@) ae — g(%) = 2 (a, — = nay2) 


z & 


n=1 


since power series may be added and subtracted term by term, by Lemma 3.12. But the 
above expression can be written as 


[e,@) 
yl age ee ag 
n=1 


N 
—] —2 —1 —] 
— Y anle” +7202" “++: +2 — NZ ] 


n=1 


e,) 
+ 5 angle +202" 7 +20 $7 = ng! 
n=N+1 
= 2X] + Xp, say 
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Given any ¢ > 0, we first choose any r such that |zo| < r < R. Then }> na,r'! is 
convergent and, as in Lemma 3.6, there exists N = N(e) such that 


[e,2) 


> |nanr")| < €/4 


n=N+1 
Since |zo| < r, if z is close enough to zg to ensure that |z| < r also, then 


[o,@) 
[Z21< > 2nlan|r*! < e/2 (4.6) 
n=N+1 


Furthermore, &1 is a polynomial in z so 41 — Oas z —> zo. We can therefore find 6 > 0 
such that 


|z — zo| < 6 implies |X {| < ¢/2 (4.7) 
We now choose z close enough to zo to ensure that both (4.6) and (4.7) hold, and then 


| f(z) — f(z) 


Z— ZO 


=€E 


| € 
g(zo)| = |214 al S [2+ eal = 5 + 


Therefore f’(zo) = g(zo) as claimed. 


Theorem 4.20 is immensely important, for it tells us not only about the first derivative 
of a power series, but about all higher derivatives as well. We just repeat it as often as 
we want to get: 


COROLLARY 4.21. All the higher derivatives f', f’,f",..., ©, ... of a power series 
f() = > ayz exists for z within the disc of convergence, and 


f° @ = Yinn=1)--- (nk + Vane 
n=k 
_ 3 n! n—k 
~ (n= Pe 


Proof. Use induction on k. 


Replacing z by z —Z in Corollary 4.21, we find that if a power series f(z) = > a,(z— 
zo)” has disc of convergence |z — zo| < R, then inside this disc of convergence, all the 
higher derivatives of f exist, and 


[o,2) 


! 
o>, Ge “ pine ay 


n=k 


Putting z = zo in this series we get 
f Go) = Mag 


which gives yet another important corollary: 
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COROLLARY 4.22 (Taylor’s Theorem). [f f(z) = )> a,(z — Zo)” for |z — zo| < R, then 


f (zo) 
a= 
k! 


We can thus express f(z) as a convergent Taylor series 


oO ¢(n) 
{@= ye (|z — zo| < R) 


n=0 


Example 4.23. Suppose that f(z) = 1/1 —2 = 14+z+2 +--+ for |Z < 1. 
Differentiating, we find that 


“o\) = ! = 24, 
f@ a ia 2z+ 3z° 4 


(2) = Seg hig?) 
f =a Z Zope 


and so on. We also have f (0) = n! and f(z) = )~ f™(0)z"/n as expected. 


A Glimpse Into the Future 


We now set the scene for an area where complex analysis is much better behaved 
than real analysis: differentiability. A real function may be differentiable but have a 
non-differentiable derivative. By repeated integration and the Fundamental Theorem 
of Calculus, it follows that for any n there exist real functions that are differen- 
tiable n times but not n + 1 times. There also exist continuous real functions that 
are differentiable nowhere, such as the ‘blancmange function’ described later in this 
section. 

Complex functions are just as flexible as real ones when it comes to continuity, but the 
condition of differentiability is much more stringent in the complex case. Constructing a 
continuous complex function that is differentiable nowhere is trivial: a simple example 
is the real part f(z) = rez, as we proved in Example 4.9. In compensation, we prove in 
Chapter 10 that if a complex function is differentiable in a domain D, then it is differ- 
entiable any number of times in D, and it even has a convergent power series expansion 
near any point in D. 


Real Functions Differentiable Only Finitely Many Times 


In the real case, for any n € N, there exist functions that are differentiable n times but 
notn-+ 1 times. A simple example when n = 1 is 


0 («<0) 
x (x>0) 


P(x) = 
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Differentiation 


Trivially, 
Axe) © G20) 
sa =| 2x (x > 0) 
and an easy calculation gives 
— (0 
$0) = tim $= HO _ 9 
x0 x 


Thus ¢’ exists, and is even continuous at 0. But #”(0) does not exist because 


$()- 90 [0 @<0) 
x ~ | 2 @>0) 


More generally, the function 


_ J 0 (x < 0) 
oo ={ x*+1 (x > 0) 


is differentiable n times everywhere, but not n + | times at the origin. 

We shall see later that there is no similar method for piecing together complex func- 
tions to obtain this type of behaviour. For a real function, there are only two ways to 
approach a limit point xo: from the left (smaller values of x) and from the right (larger 
values of x). We can piece real functions together quite happily, provided we make sure 
that the left and right derivatives are the same to whatever extent we deem necessary. 


Bad Behaviour of Real Taylor Series 


Indeed, we can go further. Above, we pieced together 0 on the left and x”*! on the right 


to obtain a function that can be differentiated n times but not n + 1 times. But we can 
even find functions like 


_ J 0 (x < 0) 


which has all derivatives in agreement at the origin, when approached either from left or 
right. This ‘Frankenstein’s monster’ creation is patched up well, but there is something 


unnatural about it. Because all derivatives are zero at the origin, its Taylor series about 
the origin is 


OO p(n) 
~ a rer + Ox? 4 + Ox” 4 =0 
a n! 


This converges for all real x, but its sum is not equal to (x). 

Thus in the real case we can find infinitely differentiable functions that are not equal 
to their Taylor series. Despite our reference to Frankenstein, there is nothing mysterious 
about this. It just means that the remainder term R,(x) in the Taylor series 


P(x) = ay + ayx + agx* +--+ +.a,x" + R(x) 


does not tend to zero as n tends to oo. Indeed, here R,(x) = (x) for all n. (In the 
theory of differentiable manifolds, the ability to patch real functions together smoothly 
like this is actually very useful. It provides topological flexibility. In this context, the 
complex case really is more complicated.) 


4.6.3 
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The Blancmange function 


Real analysis has even more grey areas, inhabited by functions that are continuous every- 
where yet differentiable nowhere. Bernard Bolzano found such a function around 1831, 
but it was not published until 1922. Charles Cellérier discovered one around 1860, which 
was published in 1890, shortly after his death. The first such function to be published 
was constructed by Weierstrass in 1872, as the sum 


CO 
f(xiy= > a” cos(b" 1x) 

n=0 
where 0 < a < 1,b is a positive odd integer, and ab > 1 + 37/2. The smallest 
value of b satisfying these conditions is b = 7. Weierstrass’s function is continuous 
by uniform continuity of the terms in the sum. Term-by-term differentiation leads to 
a series that does not converge, and Weierstrass gave a rigorous proof that the func- 
tion is nowhere differentiable. Later Godfrey Harold Hardy improved the conditions to 
0 <a < l,ab > 1. Cellérier’s example is very similar to Weierstrass’s. Figure 4.2 
shows how irregular this function is. 

We describe a simpler example that goes by the name of the blancmange function. 
Let G(x) be the distance from x € R to the nearest integer. This has a graph like the 
teeth of a saw: 

x (0 <x<}) 
l-x (§<x <1) 


c= { 


and G is periodic with G@ +n) = G(x) for any integer n. It is not differentiable at 
x= hn for any integer n. Moreover, |G(x) < 5 for all x € R. It follows that the 
function G,,(x) = (4)"G(4"x) is not differentiable at x = $(5)™ for any integer m, and 
satisfies 0 < G,(x) < ray 

The sum 


b(x) = )) Gala) 
n=0 


is sketched in Figure 4.3, which shows why it is called the blancmange function. 


Figure 4.2 Weierstrass’s continuous nowhere differentiable function. 
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Figure 4.3 Top left: Successive halvings of the sawtooth function. Bottom left: Adding the 
sawtooth functions gives successive stages of the blancmange function. Right: An advanced 
stage of the construction. We show only the unit interval; the full function is periodic with period 
1 and defined on the whole of R. 


Intuitively, the curve is a fractal, a set with detailed structure on all scales of mag- 
nification, Mandelbrot [13] and Falconer [4]. This is clear because the graph of the 
blancmange function has smaller blancmanges everywhere. Some are clearly recognis- 
able, while others are disguised by being sheared vertically as they stand on straight line 
segments that lie at an angle. In general, there are two kinds of points on the graph: 
those at dyadic rationals x = k2~” for integers k, n with n > 0, where the graph comes 
down vertically from the left and goes up vertically to the right. Here, under high mag- 
nification the graph looks like a half-line pointing upwards. Everywhere else, the graph 
is locally a tiny blancmange, sheared on a small line segment. This is easy to see when 
the segment is not too steep, but not when it is nearly vertical. 

A rigorous proof requires extra effort, and we sketch one now: 


PROPOSITION 4.24. The blancmange function is continuous everywhere and differen- 
tiable nowhere. 


Proof. The estimate 


ia L 
0=b@) =) r= 77 =3 


T 

n=0 4 

holds, so by uniform continuity, as in Section 2.9 on space-filling curves, bis continuous 
everywhere. 

To prove it is differentiable nowhere, observe that in order for b to be differentiable 
at a, the expression y(x) = (b(x) — b(@))/(x — a) must tend to a limit. We construct a 
sequence (a) — a such that y(a,) does not converge. 

Under each line segment of G,, there are two identical teeth of G,+1 of length Gr ; 
so we can finda, =a + Gy such that 


Gin(Qn) = Gn(@) (m=n+1) 
The gradient of the straight bit of the tooth of G, and larger teeth is 


Gin(@n) — Gin(o) - 


An — a 


1 (m<n) 
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Hence 


D(a) — = cnt Gin(@) 
An — = sy —a 


is asum of n+ | terms, each +1. Now y(q,) is an odd integer when n is even and an 


y(n) = 


even integer when n is odd. Therefore (y(a@,)) cannot tend to a limit. 


Having created the bad function b, its antiderivative 


bi) = ; : b(t)dt 
0 


is differentiable once everywhere (with derivative b) but is twice differentiable nowhere. 
Repeating this process inductively we obtain a function b,, that is differentiable n times 
everywhere (with derivative b,_1) but is (2 + 1) times differentiable nowhere. 

Looking back at the idea of a space-filling curve, we begin to see why our intuitive 
ideas of continuity and differentiability need modifying to fit with the formal epsilon- 
delta definitions of continuity and limits. The space-filling curve is constructed as a 
formal limit of curves made up of straight line segments, and in the construction we 
take successively smaller line segments that change direction. In the case of the blanc- 
mange function, successive curves change direction at dyadic rationals of the form k2~” 
where k,n are integers and n > 0. The limit function is continuous everywhere and 
differentiable nowhere, with the slope at the dyadic rationals being vertically down to 
the left and vertically up to the right. The space-filling path constructed here is the limit 
of graphs whose real and imaginary parts also turn suddenly at dyadic rationals (Fig- 
ure 2.26) yet the distance travelled in each step gets smaller and smaller (Figure 2.27). 
This is how it manages to go through every point in the unit square. 

Our strategy to deal with this problem is simple. When we calculate integrals in com- 
plex analysis, we use ‘contours’ made up of a finite number of ‘smooth paths’, which 
have non-zero continuous derivatives to ensure that the formal approach resonates with 
our intuitive idea of tracing the contour in real time using a finger. 


Complex Analysis is Better Behaved 


Real analysis is a very hairy subject indeed. But what is the relevance of such bizarre 
functions in complex analysis? The answer is: NONE WHATSOEVER. They have been 
mentioned only to contrast the real case with the complex case. 

There do, however, exist very simple complex functions that are continuous but differ- 
entiable nowhere. An example is f(z) = |z|, whose continuity is easily proved, but fails 
to satisfy the Cauchy—Riemann Equations at any point. There are also functions, such 
as f(z) = |z\*, that are differentiable only at isolated points (here, the origin), and are 
thus not differentiable twice at those points. But that is the end of the line. If a complex 
function is differentiable once in a domain, then (as we prove in Chapter 10) it is differ- 
entiable any number of times, has a convergent Taylor series, and is equal to its Taylor 
series. The reason for this much nicer behaviour is that in computing the derivative 
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f'@= lim f(2) — F(Z) 


zm 82 —Z0 


z can approach zo from any direction in the complex plane. The existence of the limit 
is therefore a much stronger condition than it is in the real case, and precludes patching 
together different functions at zo. 

The key to the whole theory of complex analysis is the fact that every differentiable 
complex function has a power series expansion near any point of its domain, so is equal 
to its own Taylor series as in Corollary 4.22. This will be established by a roundabout 
route in Chapter 10, but it is worth waiting for, and it underlines our emphasis on power 
series. They are not just good examples of differentiable functions: in a very genuine 
sense they are the only examples. 


Exercises 


1. From first principles, differentiate: 
@) f@=242z 
(i) f(@) = 1/z(z #0) 
Gil) fQ =P +2 
2. Show that f(z) = |z| is continuous everywhere and differentiable nowhere. Show 
that f(z) = |z|? is continuous everywhere and differentiable at the origin but 


nowhere else. 
3. Differentiate: 
@) 322 +22 
(ii) (2 + 3iz — 4)(23 — Ti) 
(iti) (2° — 13iz?)? 
(iv) (22 + 3)9(z4 + (26 — 11i)z)!” 
(v) (22 +3)/(4z3 +5 — 3i)? 
n 
(vi) (=) neNzeén 
—n 


z 


4. Let 
Zz n 


In(2) = (1+ ) 


= 
fOzPS (* *) 
n 


What do you notice as n > oo? 


Show that 


5. Let Cy, = {2 € C:z4-x€ R,x < 0 be the ‘cut plane’ with the negative real 
axis removed. Prove that C,, is a domain. Define r : C, — C by (r(z))? = z and 
rer(z) > 0. Prove that r is continuous on C,,, and hence show from first principles 
that r’(z) = 1/(2r(2)). 

6. Let f(z) be a polynomial in z € C. Prove that the function g(z) = f(z) is dif- 
ferentiable everywhere, but that A(z) = f(z) is differentiable at 0 if and only if 
f'(0) =0. 


7. 


10. 


11. 


12. 
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In each of the following cases, for f defined on the domain D, find explicit formulas 
for u(x, y), v(x, y) where f(z) = u(x, y) + iv(, y), where z = x+ iy and all of x, y,u,v 
are real. 

@) f@=1/z, D={zeC:zF40} 

(i) f(z) = |zl, D=C 
(iii) f(z) =z, D=C 
Show that u,v satisfy the Cauchy—Riemann Equations everywhere in (i) and 
nowhere in (il), (iil). 


. Verify the Cauchy—Riemann Equations for the functions u(x, y), vx, y) defined in 


the given domains by 
(i) u(x, y) =x? — 3xy’, vx, y) = 3x’y — y? (Ay € R) 
(ii) u(x, y) = sinx coshy, v(x, y) = cos xsinh y (x,y € R) 
(iii) u(x, y) =x/@? +9), (xy) = —y/? +), O? ty? #0) 
(iv) u(x, y) = } log? + y?), vay) = sin” (y/V/x2 + y2) (x > 0) 
In each case, state a complex function whose real and imaginary parts are u(x, y) 
and v(x, y). 


. For z= x+y, let 


3 . 3 . 
xd+ip)p-yd- a) 
= 
# ety 
f(0) = 0 
Show that f is continuous at the origin, the Cauchy—Riemann Equations are satisfied 
there, yet f’(0) does not exist. Why does this not contradict Theorem 4.12? 


Let f(z) = VJ|xy| for z = x + iy. Show that the Cauchy—Riemann Equations 
are satisfied at the origin yet f’(0) does not exist. Why does this not contradict 
Theorem 4.12? 


Consider 


(z #0) 


+1 . 
fo) = 2 (Z=x+iy #0) 


FO) =0 


Verify that lim,.,,(f(z) — zo)/(z — zo) = 0 as z — O along any straight line, 
z= (a+ ib)t,t € R. This does not prove that f’(0) = 0, however. By considering 
z — 0 along the path z(t) = # + it, show that f is not differentiable at 0. (This 
shows that when computing /’ it is not enough to consider a limit taken along certain 
specific paths or types of path. An entire neighbourhood of the point concerned must 
be considered.) 


For each of the following, compute f’(y(1)), y(t), (fv) and verify directly that 
(fv O =fY'Oyv'O 


@ fO=2,yH=P +if (EC re [0,1] 
(i) f(z) = 1/z, y@) = cost +isint (z £0,t € [0, 277]) 
Git) f@=1lt+z+2+---,yQ=t+ iP (\2| < 1,t€ [0, 5) 
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13. 


14. 


15. 


16. 


Suppose that f(z) = )°a,z” is convergent for all z € C and satisfies f/ = f and 
f() = 1. Find a, for all n > 0. Consider the derivative of g where 


8) =fle — OF) 
for c € C and deduce that 
fla+b)=f(a +f) 
for all a,b € C. Compute f(1) to five decimal places. (A calculator or computer 
may be used, but electronic assistance is not necessary.) 


Show that for all a € C the power series 


[e,@) 


fai = ye ene 


n! 


n=1 
converges for |z| < 1, and that in this domain its derivative is wfa(z)/(1 + 2). 
Is the radius of convergence | for alla € C? 
By differentiating fo (z)fp(z)/fa+p(z) or otherwise, show that 


fu+p@) = falafp (2) 


Deduce that for |z| < 1, 
fr@ = (1+ 2)" 


for every integer n (positive or negative), and that 


(fin())" = (1. +2) 


for all positive integers n. 

(This power series can be used to define (1 + z)* fora € C and any z in the 
domain |z| < 1.) 
Assume that the two power series s(z) = )0 anz” and c(z) = >> bnz" are convergent 
for all z € C, and that they satisfy the relations s’(z) = c(z),c’(z) = —s(z). Deduce 
the identities 


dy = —an2f/(Nn—1)) bn = —an-2/(n(n — 1)) 


If further s(0) = 0,c(O) = 1, determine s(z) and c(z) completely. By differentiation, 
or otherwise, prove that 


(c(2) + (s@))” = 1 
For a positive integer n the Bessel function of order n is defined by 
le) i= Vvoo 


I=) rln+n! 


r=0 


Show that this converges for all complex z and satisfies the differential equation 


d?y 
aire Pea (2 —n*)y =0 


17. 
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Verify the following: 

@) Jn—1(2) + Jngi@) = #In(2) 
Gi) JZ) = Sdn) — Ing @) 
(iii) Jn@) = 7Vn-1@ — Ing) 
(iv) Ji,(2) = In—1@) — 2Jn(@) 

(v) &(2"Jn(z)) = 2"Jn—1@) 

(vi) J2(z) — Jo(z) = 249 (2) 
Define f : R > Rby 


_f0  @<0) 
fe) = et* (x> 0) 


Show that f is differentiable arbitrarily many times, and that f™(0) = 0 for all 


neN. 


5.1 


The Exponential Function 


An abstract theory of functions may be intellectually diverting, but it pays its way by 
its more concrete applications by defining specific functions of particular interest and 
establishing their properties. As a step in that direction we now discuss the complex 
versions of the usual (real) exponential and trigonometric functions exp, sin, and cos, 
defining them as power series. We also introduce complex versions of the usual related 
functions such as tan and cosec. From these definitions we develop some basic properties 
of these functions. Euler’s famous formula 


exp(i@) = cos@ +1isin0@ 


follows at once from these definitions, and shows that the fundamental function here is 
exp. All of the others may be defined very simply in terms of it. We also discuss complex 
generalisations of the hyperbolic functions, such as sinh, cosh, tanh, which again have 
simple definitions in terms of exp. 

Most of the material generalises directly from the real case, and will be presented in 
a compact form. In addition to deriving the standard formulas, we place some emphasis 
on convincing the reader that the power series do indeed represent the usual real func- 
tions, and that in particular the geometric interpretations of sin and cos in trigonometry 
are valid. We also derive further properties of these functions that are peculiar to the 
complex case, including their relation to hyperbolic functions. 

The complex version of the logarithm requires deeper analysis, and is held over to 
Chapter 7. 


The Exponential Function 


Definition 3.26 defines the exponential function for complex numbers as the power 
series 


which is absolutely convergent for all z € C. We may therefore differentiate term by 
term. The result is the same series, proving that 


d 
os exp Zi exp Z (5.2) 
dz 
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With a little ingenuity we can use (5.2) to prove the formula 


exp(z1 + 22) = exp(z1) exp(z2) (5.3) 
derived in a cumbersome way in Chapter 3. Consider 
S(Z) = exp) exp(c — z) 
for c € C. Differentiate: 
f'@) = exp'(z) exp(c — z) + exp(z) exp’(c — z) 


= exp(z) exp(c — z) + exp(z) exp(c — z)(—1) 
=0 


By Theorem 4.14 f is constant; the constant must equal f(0) = exp(c). So 
exp(Z) exp(c — Z) = exp(c) 


Put z= z1,¢ = z1 + z to obtain (5.3). 

We wish to use the customary notation e* for exp z. To avoid ambiguities we must 
show that when z is rational, say z = m/n, this agrees with the usual real exponential 
en/n — /em, We do this as follows. 

First, observe that exp(x) > 0 for all x € R. This is obvious from the power series 
when x > 0. But when x < 0 we know that exp(—x) exp(x) = exp(0) = 1, so exp(—x) = 
1/exp(x) > 0. Therefore the nth root of exp(x) has an unambiguous meaning for any 
positive integer n. 

Next, define the real number 


e = exp(1) = 2.718281... (5.4) 
by an easy computation using (5.1) with z = 1. Using (5.3) and induction on n we obtain 
exp(nz) = (exp(z))” 

for any positive integer n. Therefore 


exp(n) = (exp(1))” = e” 


Clearly 
exp(0) = 1 
Then 
exp() exp(—n) = exp(n — n) = exp(0) = 1 
so that 


exp(—n) = (exp(n))"' =e") =e" 
Now, for any rational m/n, (n > 0) we have 


(exp(m/n))" = exp(nm/n) = exp(m) = &” 
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so that 
exp(m/n) = (e")'/" = el" 
Thus the notation 
e° = exp(z) 


does not conflict with the standard notation for powers of e, so we may (and do) use it 
from now on. Equation (5.3) becomes 


el &2 — putz (5.5) 


Putting z = x + iy this gives 


erty = eel’ 


Here e* is the usual real exponential. So we know how e* behaves provided we also 
understand e'”. We study both of these in the next two sections. 


Real Exponentials and Logarithms 


We briefly recall some standard properties of e* when x is real. 

Clearly eX > 1+ x for x > 0, which implies that e* > 0 for x > 0, and e* + co 
as x — oo. Alsoe* = 1/e* for x < 0, so e* > O for all x € R ande* > 0 
as x —> —oo. The derivative of e* is e* > O so e* is monotonic increasing for 
all x. 

That is, e* defines a continuous strictly increasing function from R onto Rt = {x € 
R : x > O}. By the Intermediate Value Theorem e* has a continuous strictly increasing 
inverse function, defined to be the natural logarithm 


log: R* +R 
with the property 


yolsxs SS c= 2 


From (5.5) we obtain 
log(x1x2) = logx, + logx. (x), x2 > 0) (5.6) 


Let y = log x, yo = log xo (x, xo € R™). Since log is continuous, 


__ logx — log xo _. Yo 1 1 
lim ———————- = lim = 


Xxx xX — xO y>yo e” — 0 e0 x0 


Therefore 


ae oe: 
dx e x 
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Trigonometric Functions 


Definition 3.26 also defines the complex sine and cosine functions by the power series 


0° 2n 
A z 
cosz = Baas onl (5.7) 
0° : ntl 
sinz = ue" ari (5.8) 


We know these are absolutely convergent for all z € C. 
Putting —z for z in (5.7, 5.8) we see that cos is an even function and sin is an odd 
function. That is, 


cos(—z) = cos(z) 


sin(—z) = — sin(z) 
Also 
cos(0) = | 
sin(O) = 0 
Differentiating term by term, 
: (5.9) 
— cosz = sin : 
a: Zz Zz 
C2 
— sin z= —cosz (5.10) 
dz 


Term-by-term addition, as in Chapter 3, leads to Euler’s formula 
e” = cosz +isinz (5.11) 
Since (e%)" = e"” for any integer n, equation (5.11) implies De Moivre’s Formula 
(cos z +isinz)” = cosnz+isinnz (5.12) 


This may be used to obtain rapid derivations of formulas for cosn@ and sinn@ when 
@ ER, by equating real and imaginary parts of both sides (Exercise 6). 
Euler’s Formula has the famous corollary 


| (5.13) 


Formulas (5.11) and (5.13) seem surprising when we first encounter them, because there 
is no obvious connection between e and z. Historically, these special numbers arose in 
very different areas of mathematics — natural logarithms and the circumference of a 
circle. Moreover, it is hard to see why i, which arises from yet a third area, polynomial 
equations, should link them together. But in fact, there is a simple reason why these 
results hold, based on yet another area of mathematics: differential equations, applied to 
uniform rotation in the plane. We discuss this in Section 5.6. If anything, this dynamic 


100 


5.4 


The Exponential Function 


explanation adds to the beauty of these formulas by showing that they are at the core of 
a remarkable unification of distinct mathematical concepts. 
Replacing z by —z in (5.11) gives 
e = = cos(—z) +isin(—z) = cos z—isinz (5.14) 
From (5.11) and (5.14), 


cos z = h(e +e) (5.15) 
sinz = xe —e%) (5.16) 


From (5.15, 5.16), and (5.5), we obtain the usual addition formulas for sin and cos, now 
valid for all complex numbers 21, z2: 


sin(z, + 22) = sinz), cos z2 + cos z1 sinz2 (5.17) 
sin(z) — Z2) = Sinz) CoS Z2 — COS Z] SIN Z (5.18) 
cos(z} + Z2) = COs Z1 COS Z2 — Sinz, sinz2 (5.19) 
cos(z] — Z2) = cos Zz] COS z2 + sinz, sin z2 (5.20) 


Putting z) = z2 = z in (5.20) gives 


cos? z+ sin’ z= 1 (5.21) 


An Analytic Definition of x 


Historically the real number z was defined as the ratio of the circumference of a circle 
to its diameter, and only later did its importance for trigonometric functions emerge. We 
reverse the process, defining 7 analytically and eventually showing in Section 7.1 that 
our definition agrees with the geometric one. 

The idea is to define 7/2 as the first positive real solution of the equation cos x = 0. 
The problem is to show that there is such a thing. 

We know that cos and sin are continuous functions. Also, 


22 76 gan-2 gin 
COS) = Ea gt Geo pl 
2 gAn-2 
Sf9 hie 4n(4n — 1) —4 
3 Goo 
2 
<1-2+4+-<0 
5 


But cos(0) = 1. By the Intermediate Value Theorem, cos fo = 0 for some fp € (0, 2). 
Let k be the greatest lower bound of {t € R : t > 0,cost = 0}. By continuity, 


cosk =0 


By the definition of k, if 0 < x < k then cosx > 0. 
We define 


mw = 2k 
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Then the number z has been uniquely defined by the properties 7/2 > 0, cos(7/2) = 0 
and 0 < x < 2/2 implies cos x > 0. 

Since cos(2) < 0, it follows that 0 < a < 4. This is a crude estimate, and we improve 
it in Exercises 17 and 18 below. 


The Behaviour of Real Trigonometric Functions 


We know that cos x is positive for 0 < x < 2/2. Since a sinx = cosx, it follows that 
sin is strictly increasing on [0, 7/2]. Since 
Qt 200 
sin” —+cos —=1 
2 2 


and cos z/2 = 0, we must have sina/2 = +1. But since sin is increasing on [0, 2/2], 
we must have 


_ 1 
sin > = 
Now (5.18) implies that 
sin (5 — x) = cos x (5.22) 


Hence cos decreases monotonically from 1 to O in [0,7/2]. Using (5.17) 
and (5.19) repeatedly, we can deduce the behaviour of sin and cos in the intervals 
(7/2, 2], [2,32/2], and [37 /2, 27]. We obtain 


8 ‘ 
cos (— +x) = —sinx 
2 
_ (0 
sin{ — +x} =cosx 
2 
sin(w + x) = —sinx 


and so on. We tabulate the results in Table 5.1, where a 7 b means ‘strictly increasing 
from ato b’, and a \, b means ‘strictly decreasing from a to b’. 
From the table, 


cos 27 = | 
sin2az = 0 
Therefore 
cos(x + 27) = cos x cos 2m — sinxsin27 = cos x (5.23) 
sin(x + 27) = sinxcos 27 + cosxsin27 = sinx (5.24) 
leading to 


cos(x + 2nm) = cos x 


sin(x + 2nz) = sinx 


for all n € Z. So sin and cos are periodic with period 27. In particular the behaviour in 
the table repeats on each interval [2n7, (2n + 2)7]. 
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Table 5.1 Behaviour of sin and cos in intervals of length z /2. 


interval cos sin 

(0, 2/2] 1\,0 0/71 
[x /2, 1] 0o\-1 1\0 
[x, 32/2] -1/70 O\ -1 
[32 /2, 27] 071 -1 70 


In this way, purely formal considerations show that sinx and cos x have their usual 
geometric properties for all real x, at least in outline. The final remaining step is to 
identify the point cos 6 + isin@ = e! on the unit circle |z| = 1 with the point @ radians 
from | in an anticlockwise direction. We indicate two different approaches in Section 5.6 
and Exercise 8 in Chapter 6. We also discuss the topic in detail in Chapter 7. 

More precise computations of the values of these functions may be performed to any 
desired degree of accuracy. By inspection, it may be seen that for real x (positive or neg- 
ative) the terms of the power series for sin x and cos x always alternate in sign. From the 
standard theory for alternating real series, the sum of the first n terms alternately over- 
estimates and underestimates the actual limit. This lets us make very precise estimates 
of the trigonometric functions. 


Example 5.1. Compute e! to four decimal places. 
We have e! = cos(1) +isin(1). Now 
1 1 1 1 


i a AY | ar a 


Considering partial sums: 


cos(1) < 1 
1 
cos(1) > 1 — H 0.5 
1 1 : be 0.541 66 
cos(1) < oy Gee ae 
cos(1) > 1 : : be 0.54027... 
2! 4! 6! 
1 1 1 1 
cos(1) < 1 ata-ata = 0.54030... 
Therefore 
cos(1) = 0.5403 (to 4 decimal places) 
Similarly 
sin(1) = 0.8415 (to 4 decimal places) 
Therefore 


e! = 0.5403 + 0.8415i (to 4 decimal places) 
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Dynamic Explanation of Euler’s Formula 


Euler’s formula(s) linking e,i, 7 seem surprising, but the existence of such a link, and 
even the details of how it goes, can be deduced very naturally from standard ideas in 
the theory of differential equations. This section is intended as motivation, and to help 
explain why such a connection exists. It can be made rigorous by setting up the necessary 
ideas formally. It is not used later in the book and can be skipped. 

Consider the linear differential equation 


dz 
—=i eC 5.25 
ams (z ) (5.25) 
on C. Equation (5.2) and the chain rule show that a solution z(t) with initial condition 
z(0) = 1is 


z(t) = el 
This solution is unique since the difference w between any two solutions satisfies 


dw 

dt 
so w(f) is constant, and must equal w(0) = 0 

Geometrically, (5.25) corresponds to a point particle z(t) moving in the plane C, and 

it states that the velocity vector z’(f) is at right angles to the position z(f), and the speed 
is |z(t)|, Figure 5.1. (Here ’ indicate the time-derivative. The right angle arises from the 
factor i in the equation.) That is: the particle always moves at right angles to the line 
joining it to the origin. Intuitively, the particle must move along the unit circle. To verify 
this, we prove that z’(f) is always tangent to the unit circle S, or equivalently that z() € S 
for all ¢. The key point is that |z(t)|* is conserved; that is, it is constant. Compute: 


“ Iz(t)? = £020) =Z20nO +202) = Ge) + C)Cie™) =1-i1=0 


so |z(f)| is constant. Since it equals | at time 0, we know that Iz(t)|> = 1 for all t. 
Therefore z(t) always lies on the unit circle. 


punil speed 


Figure 5.1 Uniform motion round the unit circle. 
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The speed of motion is 


Ol =Ve2@20 = Vee = | 


so the particle moves with unit speed (anticlockwise). 

After time f, the particle therefore moves a distance ¢ along the unit circle in the 
anticlockwise direction; that is, through an angle of t radians. 

At time t = z is has gone a distance z round the unit circle — that is, halfway round. 
So it must lie at the point diametrically opposite the starting point z(0) = 1. This point 
is —1. So z(a) = —1; that is, e#” = —1, the miraculous formula. 

More generally, at time f its position on the unit circle makes angle ¢t radians with 
the real axis at the origin, so its Cartesian coordinates are (cost, sint). Interpreted as a 
complex number, this point is cos ¢ + isint. Therefore 


e’ =cost+isint 


and we recover Euler’s formula (5.11). This approach is closely related to Section 7.1. 


Complex Exponential and Trigonometric Functions are Periodic 


We are used to the real sine and cosine functions being periodic, with period 27, and 
we have proved that the same property holds for their complex generalisations. Euler’s 
formula shows that their periodicity is inherited by the complex exponential function, as 
we now describe. 


DEFINITION 5.2. A complex function f : S—> C has period p € Cif 


f@+p) =f@ (z,z+ 9 €S) 


Obviously, if p is a period of f, so is np for any positive integer n such that mp € S 
for m < n, and with similar conditions we can also take n negative. 
For the complex exponential, S = C, and 


e271 — cos2n7 + isin2x = 1 


SO 


tem _ eve27 =.l=e& 


Therefore 2771 is a period for exp. So is 2nzi for any n € Z. There are no others: 


PROPOSITION 5.3. The complex number p is a period of exp if and only if p = 2nmi 
for somen € Z. 


Proof. If p is a period then e? = e+? /e& = 1. If po —u+ivthen 


1 =e” = eWel” = e”(cosv +isiny) 


Taking the modulus, 
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1 = |e”||cosv + isin y| = e” 
By properties of the real exponential proved in Section 5.2, u = 0. So now 


cosv+isinv = 1 


Therefore cos v = 1, sinv = 0. By Table 5.1, v = 2nz, (n € Z). 


We also have: 


PROPOSITIONS.4. The exponential function e? is non-zero for any z € C. 


Proof. For any z € C, ee~* = e® = 1. 


We now investigate sin and cos for periodicity in C. Certainly 2nz is a period of both, 
because the proofs of (5.23) and (5.24) for real x also work for complex z. 


PROPOSITION 5.5. The complex number p is a period of sin or cos if and only if 
p = 2nzx for somen € Z. 


Proof. Since sing + 2/2) = cosz by (5.17), it follows that ¢ is a period for cos if and 
only if it is a period for sin. Then 


cos(z + p) = cosz 


sin(z + p) = sinz 
for all z € C. But then 


e+) — cos(z + p) + isin(g + p) = cosz +isinz =e” 


so ip is a period of exp. By Proposition 5.3, ip = 2nziforsomen € Z,so p = 2n7. 


We can also find the zeros of sin and cos: 
PROPOSITION 5.6. Let z €C. Then 
cos z = 0 if and only if z=(n+ 5) 
sinz = 0 if and only if z =n 
where n € Z. 


Proof. Since sin(z + 1/2) = cos z by (5.17), the second equation implies the first. Now 
sin z = 0 if and only if (e% — e~”)/2i = 0. This holds if and only if e“ — e~% = 0, that 
is, if and only if e*” = 1. So 2iz = 2nmi. Therefore z = nz as claimed. 


Other Trigonometric Functions 


IfzA@t 5) then cos z 4 0, so we may define 


sin Z 
tan z = —— (5.26) 
COSZ 


IfS={zeC:zA(nmt+ s\n € Z} then S is a domain, and tan: S > Cisa 
differentiable function. Its derivative is 
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d cos zt sin Z —sinz$ COSZ 
—tanz= 5) - 
dz cos’ z 
cos Zz: cosz — sinz : (— sinz) 
cos? z 
cos* z + sin? z 
a cos z 
=1+tan’z 
Similarly we define 
COS Z 
cotz= — (z#nzT) (5.27) 
sin z 
1 
secz= ——_ (A (n+ 5)m) (5.28) 
COS Z 
1 ; 
cosecz = —— (zg nz) (often also written csc z) (5.29) 
sin z 


These are all differentiable functions (on the stated domains) whose derivatives may 
again be calculated in the usual manner. All of the standard formulas relating these 
trigonometric functions may be deduced from properties of sin and cos, hence also apply 
for complex values of the variables. 
For example, using (5.17) and (5.19), we obtain 
tan z, + tan z2 
1 — tan z, tan z 


tan(z} + 22) = 


provided that z,,Z2,z, + Z2. € S. This implies that tan(z + 7) = tanz, so 7 is a period 
for tan. It is easy to see that the only periods of tan are nz forn € Z. 

The reader is encouraged to develop all of his or her favourite trigonometric formulas 
for the complex case, including the basic properties of cot, sec, and cosec. 


5.9 Hyperbolic Functions 


As in the real case, we define 
sinh z = h(e —e*) 
coshz = ne +e%) 
for z € C. Differentiating, 
a h h 
— sinh z = coshz 
dz 
d : 
— cosh z = sinhz 
dz 
Properties of the hyperbolic functions, analogous to those of trigonometric functions 


(such as addition formulas for sinh(z1 + z2)) follow either by direct computation, or by 
using the obvious identities 


siniz = isinhz 


cosiz = cosh z 
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For example, 


cosh? z — sinh? z = cos? 


iz — (—isin? iz) 
= cos? iz+ isin? iz 


=1 


The functions tanh, coth, sech, and cosech (or csch) are defined in the obvious way. We 
leave it to the reader to discover their properties, including zeros and periods. 

Hyperbolic functions occur in expressions for the real and imaginary parts of sin z 
and cos z. Thus let z = x + iy. Then 


sin z = sin(x + iy) 
= sinx cosiy + cos xsiniy 


= sinx coshy + icos x sinh y (5.30) 
Similarly 


cos z = cos xcosh y — isinx sinh y (5.31) 


Exercises 


1. Express the following in the form a + ib for real a, b: 
(i) exp(i) 
(ii) eo tin 
(iii) 1/ exp(2 + iz) 
2. Express the following in the form a + ib for real a, b: 
(i) sin() 
(ii) cos() 
(iii) sinh(i) 
(iv) cosh() 
(v) cos(z/4 —1) 
(vi) tan(1 +1) 
3. Differentiate the functions defined as follows: 
(i) exp(z* + 2z) 
(ii) 1/ exp(z) 
(iii) exp(*)/exp(z + 1) 
4. Differentiate the functions defined as follows: 
(i) tan(z’) 
(ii) sinh(z + 2) exp(z?) 
(iii) sin(z) cosh(z) exp(z) 
5. Use the identity e!%e'? = e+) to derive the usual formulas for cos(@ + @) and 
sin(O + @). By a similar method, show that 


1/(cos@ +isin9) = cos@ — isind 
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6. Use the identity (ei?) = e7!? to give the usual formulas for cos(30) and sin(36). 


10. 


11. 


12. 


Derive similar formulas for cos(4@), sin(40), cos(56), and sin(58). 


Draw the following paths: 

(Gi) y(t) =e (t € [0, z]) 

Gi) y(t) = 1+i+2e™ (¢ € [0,27]) 
(iii) y(t) = zo + re~™ (t € [0, 27]), where zo € Candr > 0 
(iv) y(t) =t+icosht ¢ € [—1,1]) 

(v) y(t) = cosht+ isinht 

‘Osborne’s rule’ states that any formula involving sin and cos has an analogous 
formula involving sinh and cosh, which is the same in every way except that the 
product of two sines must be replaced by minus the product of two hyperbolic sines. 
For each of the following formulas, write down the corresponding formula using 
Osborne’s rule and verify it from first principles: 

(i) sin2A +cosA = 1 

(ii) sin(A — B) = sinAcosB-— cosA sinB 

(iii) cos(A + B) = cosA cos B— sinA sinB 

Comment on Osborne’s rule in the light of the formulas 


cos iz = cosh z siniz = isinhz 


Show that the complex conjugate of cos z is cos Z and that of sin zis sin z. Verify the 
identities 


| sinz|* = 5(cosh 2y — cos 2x) = sinh? y + sin’ x = cosh’ y — cos” x 


2 x = cosh’ y — sin? x 


| cos z/? = 1 (cosh 2y + cos 2x) = sinh? y + cos 


Show that | cos Z|" + | sin z|? = 1 if and only if zis real, and that cos z is unbounded 
on C (that is, no K > 0 exists such that | cos z| < K for all z € C). This contrasts 
with the bound | cos x| < 1 for all real x. 


Derive formulas for the real and imaginary parts of the following functions of z, and 
check directly that they satisfy the Cauchy—Riemann Equations: 
(i) expz 
(ii) sinz 
(iii) cosz 
(iv) sinh z 
(v) coshz 
Derive formulas for the real and imaginary parts of the following functions of z, 
specifying the largest domain on which they are defined, and check directly that 
they satisfy the Cauchy—Riemann Equations: 
(i) tanz 
(ii) tanh z 
(iii) cosec z 
(iv) cosech z 
(v) cotz 
(vi) cothz 


13. 


14. 


15. 


16. 


17. 
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Write tanh(x + iy) in real and imaginary parts and show that tanh(x + iy) is real if 
and only if y = n7/2 forn € Z. 


For each of the functions exp, cos, sin, tan, cosh, sinh, tanh, find the set of points on 
which it assumes: 
(i) real values; and 
(ii) purely imaginary values. 
By considering the real and imaginary parts of 
1— zntl 
I+ ztete te = —-_— 
1—z 
find explicit formulas for the sums: 
(i) 1 +cosx+cos2x+---+cosnx 
(ii) sinx + sin2x-+---+ sinnx 
By similar methods, find: 


(iii) cosx + cos 3x + --- + cos(2n — 1)x 
(iv) sinx + sin3x+---+ sin(2n — 1)x 
(v) sinx — sin2x+---+(—1)" sinnx 
(vi) cos@ + cos(@ + @) +---+cos(0 +nd) 
(vii) sin@ + sin(@ + @)+---+ sin(@ + nd) 
If z(t) = x(t) + iy( is a solution of the differential equation 
d?z iwt 
show that x = x(t) = re z(f) is a solution of 
OF pe iutosnt 5.33) 
ae + AX = ko COS@ (S. 


By finding solutions of (5.32) of the form z(t) = ke!", find a solution of (5.33). 
If ko is complex, say ky = kye“ (k,€ € R), and A, w are real, write down the real 
part of (5.32) to obtain 
d?z 
a2 + Az = ko cos(wt + €) (5.34) 


Show that there is a solution of (5.34) of the form 


ky 
A — w?2 


x(t) = 


cos(wt + €) 


By using the sum of the geometric progression 


5 J —zrtl 
Lehehe  Mea eg SS 
l-z 
find A,,(x) such that 
1 
exe See 44,0) 


= 1 
1+ x? 
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18. 


Verify that the derivative of tan~! x is 1/(1 + x”) where tan! : R > (—2/2, 1/2) 
is the inverse function of tan : (—72/2, 2/2) — R. Hence deduce that 


t 


t 
tates f e =P /3+P /§—( yet yansiy+ f An(x)dx 
Gg: bbe 0 


By estimating the size of the second integral, show that the power series 


fo Sas ey One 


converges to tanf¢ for |t| < 1. 
Deduce Gregory’s series 


=1-24 bee 
nl ee) 


Gregory’s series converges very slowly. Classically, better methods for calculating 


zt were obtained using the related series 


= tan”! I Aan i 
— 2 


3 


ALA ALN 


1 
5 239 
Verify these formulas using the addition formula for tan(@ + @). Use the second to 
calculate 2 correct to 5 decimal places. (This requires only one term of tan~! a5 
and five terms of tan~! .) 


1 
= 4tan! = + tan7! 


Series that converge far more rapidly to z are now known. You can find them 
easily on the Internet. 


Integration 


The next part of the grand plan is to define complex integration by analogy with the real 
case, and establish the inverse relation between differentiation and integration. 
Consider a complex function f : D — C on a domain D, and let ~,z, € D. The 
real integral [| i f(t)dt does not generalise immediately to the complex integral he F(z)dz 
because this expression does not specify how z goes from Zp to z1. To do that, we must 
specify a path y between them. A sensible notation for the integral is then i f (2)dz, or 


i f for short. 


In real analysis the value of a definite integral [ i f(t)dt depends only on the limits 
a,b. Complex analysis is more complicated, and the integral along a path may depend 
on the path as well as on its end points. We give a simple example in Section 6.10 as 
soon as we have introduced the necessary concepts and techniques. 

There are two approaches to defining - f. The first is to build up the theory of com- 
plex Riemann sums by mimicking the real case. We do this in Sections 6.1 and 6.2. This 
approach applies to any continuous function, provided it is integrated along a continu- 
ous path y : [a,b] — C. This is one reason why we built the condition of continuity 
into the definition of a path in Chapter 2, by requiring y to be a continuous map. 

In the special case where the function y is continuously differentiable, with derivative 
y’, we derive an explicit formula for the integral: 


b 
jo (fy @)y de (6.1) 
y a 


Here zo = y(a), 1 = y(b). This formula makes sense only when y’ is defined and the 
expression concerned is Riemann integrable; the requirement that y should be continu- 
ously differentiable is the simplest way to ensure this. To simplify terminology, such a 
path is said to be smooth. 

A key result, the Estimation Lemma 6.41, bounds the absolute value of such an inte- 
gral by the product of the supremum of | /f(z)| on y and the length of the path y. It 
turns out that ‘length’ need not be a meaningful concept for a continuous path — another 
somewhat counterintuitive fact — but it is well-behaved for smooth paths. In Section 6.3 
we define ‘length’ and show that for any smooth path it is given by the simple formula 


b 
Le i ly"(Oldt (6.2) 


Notice that the derivative of y features explicitly in this formula, which is one reason 
for assuming smoothness. 
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These equations lead to a second approach, in which the reader is assumed to be 
familiar with real integration. Why not restrict the theory to smooth paths (or, more 
generally, piecewise smooth paths — that is, finite sums of smooth paths — which pose 
no new problems)? Then we can use (6.1) and (6.2) to define Ty f and L. Admittedly, 
the integrand (fy(t))y’( in (6.1) is complex, but we can reduce the formula to real 
integrals by writing it as U(t) + iV() and defining 


b b 
- f= ) U(t)dt +i / V(idt 
y a a 


We adopt this method in Section 6.8 as an alternative route to complex integration, 
allowing Sections 6.1—6.3 to be omitted. This short cut has a price. It means that a 
couple of proofs later in the chapter must be given in a more technical and less intuitive 
manner. But this price is not very great, and it leaves the reader with a genuine choice: 
work through the theory of complex Riemann integration to experience the full analogy 
with the real case, using continuous paths, or bypass the next three sections and start at 
Section 6.4, restricting to smooth paths. 


The Real Case 


For the reader who has chosen to build up the analogy between the real and complex 
integrals, we begin by recalling the real case. 
The Riemann integral e o(t)dt of a real function ¢ : [a,b] — Ris defined in stages. 
First, subdivide the interval [a, b] to obtain a partition P of [a, b] given bya = ft < t) < 
- < t, = b, and choose intermediate points s; in each subinterval t;_| < s; <t,. Then 
form the Riemann sum 


S(P.6) = D> b(s, Mt — t-1) 
r=1 


The points fo, f),...,t, are called the division points of P. Another partition Q is said to 
be finer than P if the division points of P are all included in those of Q. 
The following result is well known from real analysis; we quote it without proof. 


LEMMA 6.1. Let @ : [a, b] > R be continuous. Then there exists a real number A such 
that for any ¢ > 0 there is a partition P, of [a,b] such that, for every partition P finer 
than P,, we have 


|S(P,o) —Al <€ 


This real number A is denoted by i o(t)dt, and is the Riemann integral of @ from a 
to b. 
The actual computation of ri o(t)dt is usually performed by antidifferentation: 


THEOREM 6.2 (Fundamental Theorem of Calculus). (i) [fa real function @ is contin- 
uous on [a,b] and F’ = @, then 


b 
/ $(t)dt = F(b) — F(a) 
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(ii) If @ is continuous on (a, b| and 


I(x) = [ ecoar (x € [a, b]) 


then I' = @. 


Example 6.3. To compute [| i dt we do not need to calculate the sums S(P,#) where 
g(t) = t. Since F (j= - /6 satisfies F’ = @, Theorem 6.2(i) immediately gives 


b 
5a te. Pe 


More generally, we can compute i o(t)dt by seeking an antiderivative F and 
appealing to Theorem 6.2(i). 

Before passing to the complex case, it is helpful to consider a slight generalisation: 
the Riemann-Stieltjes integral , (t)dé, where 6 is a second real function. Here, given 
a partition P of [a, b] as above, we form the sum 


S(P,b, 0) = D> b(sr)O(tr) — O(tr-1)) 


r=1 


From real analysis we have a generalisation of Lemma 6.1: 


LEMMA 6.4. Let ¢,0 be real functions defined on [a,b], such that $ is continuous and 
9 has continuous derivative 0’. Then the real number B = im o(t)0’ (t)dt satisfies the 
following condition: 

Given any € > 0 there is a partition P, of (a, b] such that, for every partition P finer 
than P,, we have 


|S(P, , 0) — B] < € 


The limit B of the sum S(P,@,@) is also denoted by f . o(t)dé. It is the Riemann— 
Stieltjes integral of @ from a to b with respect to 0. Lemma 6.4 tells us that when @ is 
differentiable, the Riemann—Stieltjes integral he (t)d0 is equal to the Riemann integral 
f 4 $(£)'(t)dt. The conditions of the lemma ensure that the integrand $(r)0’(f) exists and 
is continuous. 

The Riemann and Riemann-Stieltjes integrals exist under far more general conditions 
on @,0 than those we have mentioned, and the reduction from the Riemann-Stieltjes 
integral to the Riemann integral is then not always valid. The conditions in Lemma 6.4 
are all that we require in this book. 


Complex Integration Along a Smooth Path 
Riemann sums define complex integrals for continuous functions and paths, but we can 


derive a simple formula for the integral, which often lets us compute it, if we work with 
a more special class of paths: those with continuous derivatives. 
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DEFINITION 6.5. A path y : [a,b] — C is smooth if y’ exists and is continuous 
throughout all of [a, b]. 


This means that if y(#) = x(t) + iy(), where x and y are real, then x’ and y’ exist and 
are continuous on the whole of [a, b], including one-sided derivatives at the end points. 

The definition is intended to formalise the physical sense of drawing a smooth path 
with a finger from the starting point to the final point. Invoking dynamical imagery, 
the path y describes a point particle moving in the plane, whose position at any time 
t € [a,b] is y(t). Smoothness ensures that the point y(t) moves with a specific velocity 
at all times t, namely y’(f), and that this velocity changes continuously. Our intuition 
here is not just geometric: it is dynamic. 

Recall that velocity is usually described as ‘speed and direction’, where speed is the 
magnitude of the velocity and direction is determined by the time-derivative of posi- 
tion. The speed is |y’(¢)|, a non-negative real number. When y’(t) 4 0, the coordinates 
(x'(t), y(t) of |y’(t)| determine a non-zero vector in the plane, which does indeed point 
in a specific direction. What is often ignored is the caveat that when y’(t) = 0 this 
becomes the zero vector, which does not point in a specific direction. This distinction is 
important both geometrically and dynamically, and we discuss it further in Section 6.7. 

We now consider integration of a complex function along a smooth path in C. In 
this case, the Riemann integral of a complex function f can be approached by analogy 
with the limit of the sum )° $(s;)(t- — t--1). We simply consider )> f(¢)(zr — Zr—1), 
where ¢;,z, are complex, and select ¢, and z, along a path y in the domain of f as in 
Figure 6.1. 

For this purpose we assume: 


(i) f :D — Cis continuous, where D is a domain; and 
(ii) y : [a,b] ~ Dis smooth. 


For any partition P of [a,b], given by a = to < t) <--- < t, = D, and intermediate 
points t,_; <s, < t,, form the sum 


SP.F.7) = DOP) — v(G-V) 


r=1 


Figure 6.1 A partition of a path in C. 
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Writing z, = y(t,),¢, = y(s,), this Riemann-—Stieltjes sum becomes 


n 
SIPL.Y) = Dif Ger — 2-1) 
r=1 
which exhibits the direct analogy with the real case. 
As in the real case, we rarely calculate an integral using this summation process. One 
method is to reduce the calculation to two real integrals by taking real and imaginary 
parts. If w : [a,b] > R, let 


yO = UtH)+iV(it) (té [a, 5) 


b b 
/ v()dt = : U(t)dt + i i V(odt 
¥ a a 


With this convention we derive a complex version of Lemma 6.4: 


and define 


THEOREM 6.6. For a continuous complex function f defined on a domain D, and a 
smooth path y : [a,b] > D, 


b 
fi = i f(y(@))y' dt (6.3) 
y a 
Proof. Let y(t) = x(t) + iy(O (¢t € [a,b]) and f(z) = u(x, y) + wa, y) (Zz = x +1y € D). 


Denoting u(x(t), y@), vot), y@)), xO, Y@ by u,v, x’, y’ for short, our convention for 
complex integrals gives 


b b 
[ sow 'oa= f (u + iv)! + iy’)de 


b 
= / (ux’ — vy’) + i(vx’ + uy’)dt 
a 


b b b b 
= / ux dt — / vy'dt + if vx! dt + if uy dt 
a a a a 


If we write f(v(s;)) = uy + 1v; and y(t) = x, + 1y,, then 


SCP.F.Y) = Your + iv Ge + ive) — Ort + ivr] 


r=1 


n n 
= ye = Xy-1) — Ss Vr Yr _ Yr-1) 
r=1 r=1 


n n 
+1 > VO — Xp-1) +1 ee Ur(Yr — Yr—1) 
r=1 r=1 
We now match the four integrals and four sums in pairs, and use Lemma 6.4. For 


instance, 
n n 


Yo ure = Xr—1) = Yo ul(sy), y(Sr))e(t-) = x(t-—1)) 


r=1 r=1 
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where both #(f) = u(x(t), y(t)) and x(t) are continuous on [a, b]. So given € > 0, we can 
find a partition P(€) such that for any partition P that is finer than P;(€) we have 


n 


bo | . 
Youre, = ara) = | ux dt| < a 


=i A | 


Similarly we find partitions P2(€), P3(€), P4(€) such that all finer partitions P give sim- 
ilar inequalities between corresponding pairs of integrals and sums. Taking P, to have 
division points all those of P)(€), P2(e), P3(€), P4(e), then whenever P is finer than P, 
all four inequalities hold. Therefore 


b 
sP.fiy)— | fr’ oat 


€ € 
<aras fF —=€ 
4 4 4 


Hence 


b 
; f= / f(y) y' dt 
y a 


Example 6.7. f(z) = 2, y(t) = 0? +it (t € [0, 1]). See Figure 6.2. 


1 
[r= [ soor'on 
y 0 


1 
= / (7 + it) (2t + idt 
0 


1 
= / ( +0ir = FOr id 
0 


1 1 
= i (2° — 42)dt +i i Gt =P )di 
0 0 
= [38 — 45 + iP — 3° 


_ 2,2 
=—3 Tt 3l 


Figure 6.2 Path for Example 6.7. 
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The Length of a Path 


The length of a path is defined in terms of approximating polygons, as follows. 


DEFINITION 6.8. Given an arbitrary path y : [a,b] — C, take a partition P of [a,b] 
given by a= fo < t) <--- <t, = Db, and calculate the length 


Liye) = oly) — v(@—-1)| 


r=1 
of the approximating polygonal curve yp with vertices y(to), y(t1),..-,¥(tn), Fig- 
ure 6.3. 


Figure 6.3 Length of a path as the supremum of the lengths of approximating polygons. 


The length L(y) of y is the supremum (least upper bound) of the lengths L(yp) over 


all such approximating polygons. 


The length of an arbitrary path need not be finite: 


Example 6.9. Suppose y(s) (t € [0, 1]) is defined as follows. Let m,, be the midpoint of 
[1/™+ 1), 1/n], somy = (2n+ 1)/(2n(n+ 1)). Introduce a function 4 where the graph 
of y = A(A consists of two straight line segments from (<7 0) to (mn, 1) and then to 
CG 0). Define 


_—jJt+inr@) O<t<1) 
yo={ 0 (¢ = 0) 


The image of y is drawn in Figure 6.4. 


Figure 6.4 A path with infinite length. 
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The length of this path from t = at tot = , exceeds 2, Let P be the partition 


O0< at < my < B <ctt< x < m, < 1. Then the length of the polygonal path 
exceeds 


2 2 2 1 1 1 
pa =2{14 fess 
n n—-1l 1 2 3 n 


The latter is twice the harmonic series, so it increases without limit as n increases. There- 
fore L(y) is infinite. 


In this example, the path is not smooth. The next example shows that even if the path 
is ‘nearly smooth’, its length can still be infinite. 
Example 6.10. Consider the path 


Bie t+itsin(z/t) (O<t< 1) 
v -| 0 (t = 0) 


as in Figure 6.5. 


Figure 6.5 A nearly smooth path with infinite length. 


A calculation shows that although 
y(t) = 1+ i(sin(/1) + tcos(s/t))(—z /t) 


is continuous for 0 < t < 1, the limit of 


y(t) — yO) 
t 


= 1+ isin(z/t) 


as t tends down to 0 does not exist. Hence y’ is not continuous on the closed interval 
[0, 1], so is not smooth on [0, 1] (though it is smooth on any subinterval [k, 1] with 
O0<k<1). 

We claim this path has infinite length. Let P, be the partition 


0<I/n<1/m—5)<I1fM—-1) <---< 1/2<1/2-5)<1 
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Since 


and 


1 1 sin(n — 5) 
Y i : 1 


1 iy 
1 1 
nh n—~ zy 


Ne 


the distance from y(1/n) to yU/(n — 5) exceeds 1/(n — ahi which exceeds 1/n. We 
need only compute the lengths of alternate segments of the polygonal approximation y, 
to y given by P, to find that 


1 1 
L(y) > -4 fr» ] 
n n— 


which again increases without limit. 


Integral Formula for the Length of Smooth Paths and Contours 


When a path is smooth, its length is finite, and can be calculated by an integral: 


PROPOSITION 6.11. The length of a smooth path y : [a,b] > Cis 


b 
Ly) = i ly’ (o)|de (6.4) 
and this is finite. 


We prove this result below, but first we note that the integrand |y’(t)| is continuous on 


[a, b], so the real integral 
b 
L= ly“()|de 
a 


certainly exists, and is finite. We must show that L is the supremum of the lengths L(yp) 
of approximating polygons yp. 
Now L can be closely approximated by sums 


SP, b) = Yo lv" (rte = tr) 


r=1 


where P is the partition a = {9 < t) <-+-- <%=), t-1 < 5; <t,, and d(f) = |y(0). 


120 Integration 


The proof of Proposition 6.11 is greatly facilitated by: 


LEMMA 6.12. With the preceding notation, given any € > O, there exists a partition 
Q, such that for any partition P finer than Q,, the length L(yp) of the approximating 
polygon corresponding to P satisfies 

IS(P, @) — L(yp)| < € 
Proof. By definition, 


SP,6) = Yo lv’ )G — 1) 
r=1 


Liye) = Yo lv tr) — yG-)I 


r=1 
Writing y(t) = x(t) + 1,(¢), the Mean Value Theorem in real analysis gives 
X(tr) — X(tr—-1) = x! (oy) (tr — tr-1) for some oy € (tr—1, ty) 
y(t;) — y(t—1) aa y (et, 7 tr-1) for some TE (f-1; t,) 
so 
v(ty) — VG-1) = (x'(o;) + iy'(t,) (ty — tr-1) 
Now x’, y are continuous on [a, b], and from real analysis they are uniformly continuous 
since [a, b] is a closed interval. This means that for any ¢ > 0 there exists 6 > 0 such 
that 
s,t€ [a,b] and |s—t| <6 implies 
k(s) — xD] < e/2°(b-a) and |y(s) —y'@| < €/2(b —a) 
(where the same 6 works for all s, f). 

If we choose Q, to be any partition such that each subinterval [t;—1, t;] has length less 
than 6, then any partition P finer than Q, has the same property. Because s,,0,,T, € 
[t--1,t,], they are all within distance 6 of each other, so 

ly GAG -— t—-) — vat) -— rG@—vil 
= |l"(sr)| — xr) + iy’ GI ( — t-1) 
= |(x'(sr) a iy'(sr)) = (x'(or) an iy (tr))| (t — tr-1) 
< (Ix'(s,) — xr) + by G) — YG )DG — FD 


< ( : Mtr ty—1) 
2b-—a 2b-a 


= e(t, — t_-1)/(b — a) 


where in the third line we use property (1.12) of the modulus. 
Thus 


[S(P,) — Lye) < oly IG tr1)— |v (fr) — v1) | 


r=1 


n 
E 
< ) hoy ee 
r=! 
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n 
gE 
= YG tr-1) 
b-—a a 
=E 


as required. 


We are now prepared for the following: 


Proof of Proposition 6.11. Let L(y) be as in (6.4). By the definition of the Riemann 
integral, there exists a partition P such that for any partition P finer than P,, 


ISP,6)-—Ll <€ (6.5) 


Adding more division points to P, as necessary to obtain a finer partition Q, with each 
subinterval of length less than 6, then for P finer than Q,, Lemma 6.12 gives 


IS(P, @) — L(yp)| < € (6.6) 
Combining this with (6.5) gives 
L—2e < L(yp) < L4+2e (6.7) 
Thus for any positive k we can find an approximating polygon yp with 
L—k <L(yp) (6.8) 


Given any partition Q of [a,b] with approximating polygon x, the addition of further 
vertices to x can only increase its length, by the triangle inequality. Thus if P is finer 
than Q then L(x) < L(yp). We choose P finer than Q, so that (6.7) holds; then 


L(x) < L(yp) < L+ 2 
But € is any positive number, so for any approximating polygon x, 
Lik) <L (6.9) 


The inequalities (6.8) and (6.9) exhibit L as the supremum of the lengths of all 
approximating polygons, completing the proof. 


Example 6.13. The standard straight line path y = [z1, z2] 
y@=2ud—-h+2z2t (te [0,1)) 
has length 


1 1 
Ly) = f Wola = | yd al 
0 0 


as expected. 


Example 6.14. The circle S(t) = zo + rel! (t € [0,27]), centre zo and radius r > 0, has 


length 
2n : 2n 
LS) = / lire” =| rdt = 2nr 
0 0 
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If You Took the Short Cut... 


Some readers will have read the previous three sections, and some will not. For the latter, 
we now give some basic definitions that are motivated by the previous three sections: 


DEFINITION 6.15. A path y : [a,b] — Cis smooth if y’ exists and is continuous on 
the closed interval [a, b]. 
If D is a domain, the integral of a continuous function f : D > C along the path y is 


b 
/ f= / f(y @)y' at (6.10) 
y a 
The length of y is 


b 
Ly) = f ly’@ldt (6.11) 


A comment on (6.10) and (6.11) is in order to explain what they mean. The integrands 
f(v@)yv"@ and | y’/()| are both continuous. The latter is real; the former may be written 
in real and imaginary parts as f(y(‘)) y(t) = U(t) + iV(O and the integral y f can then 
be calculated using two real integrals: 


b b 
i f= / U(dt +i / V(idt 
y a a 


Since all integrands involved are continuous, the real integrals all exist. 

At this stage, readers who made either choice are now equipped to proceed to the rest 
of the book. If you took the short cut, take a quick look at Examples 6.13 and 6.14 to 
convince yourself that the formula defining length makes sense for lines and circles. 


Further Properties of Lengths 


Recall from Definition 2.27 that paths can be added together if their start and end points 
match up correctly. Using approximating polygons, it is easy to prove that the length 
function L is additive: 


PROPOSITION 6.16. [fy =y1 +---+ yn, and L(y,) exists for 1 <r <n, then 


Ky) = L(y) +++: + Lyn) 


It is important to understand that the length of a path need not be the same as the 


length of its image curve. In particular, this happens whenever the path ‘doubles back 
on itself’, as in the following example. 


Example 6.17. Let o : [—2,2] — C be any smooth path. Let p : [—2,2] > [—2, 2] 
be given by p(t) = P — 3t, which is a smooth map, and let y = o 0 p. Then it can 
be shown, using (6.4), that L(y) = 3L(o). The geometric significance of this result is 
straightforward: as ¢ increases from —2 to 2, the cubic #3 — 3¢ increases from —2 to 2, 
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then decreases from 2 to —2, and finally increases from —2 to 2 again. So the path traces 


the curve three times. 


Next, we consider how a change of parameter affects the length of a smooth path. 


DEFINITION 6.18. Smooth paths y : [a,b] > C, A: [c,d] — C with the same image 
curve C are smoothly equivalent parametrisations of C if there is a smooth function 
p : [a,b] — [c,d] with non-zero derivative p'(t) ~ 0 for t € [a,b], where p(a) = c, 
p(b) =d,and y =Aop. 


! exists, is smooth, and 


This relationship is an equivalence relation, because p~ 
also has non-zero derivative, by the inverse function theorem. A smoothly equivalent 
parametrisation can be imagined as tracing the same curve in the same direction but at 


different speeds. 


PROPOSITION 6.19. If two smooth paths y : [a,b] > C, a: [c,d] > C are 
smoothly equivalent parametrisations of the same curve C, then they have the same 
length: L(y) = L(A). 


Proof. We have y = 40 where p : [a,b] > [c,d] is a strictly increasing real function 
with continuous derivative p’ on [a,b]. Therefore p(t) > 0, so p’(t) = |p’(d|. Let 
s = p(t). Then as ft increases from a to b, s increases from c to d and we can substitute 
ds = p’(t)dt in the integral. The length of the path y is therefore 


b 
Ly) = f y' (| dt 
er 
=| (Lo py (de 
ie 
= / Ql(e(py|le'Oldt 


b 
é / (aX(o(p)|o' (dt 


b 
= / u(s)\ds = LA) 


Lengths of More General Paths 


You may be wondering why we require y to be smooth in the definition of length in 
Definition 6.15? Proposition 6.16 applies to contours made from several smooth pieces, 
which need not fit together smoothly where they join, so paths that are not smooth 
can have meaningful lengths. Obviously, Definition 6.15 cannot be used for continuous 
paths, because it involves the derivative of y. But the definition ‘supremum of lengths 
of all approximating polygons’, which occurs in the alternative treatment of Section 6.3, 
makes sense for any path, without assuming smoothness. However, this definition has its 
own awkward features. Sometimes it goes wrong in the mild sense that the entire path 
has infinite length, as in Examples 6.9 and 6.10. However, it gets much worse. Using the 
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‘approximating polygon’ definition for length, there are continuous paths y : [a,b] > C 
with the disturbing property that the length of any segment of the path, between points 
c < din [a, D], is infinite. 

In fact, we have already met just such a path: the graph of the blancmange function, 
Figure 4.3. Using dyadic rationals, it is straightforward to construct a sequence of par- 
titions of [c,d] such that the corresponding polygons have lengths tending to infinity. 
In fact, it is enough to do this for the graph of the blancmange function on the inter- 
val [0, 1], because we have already pointed out that this graph contains arbitrarily small 
copies of the graph of the blancmange function, and a small number times infinity is 
still infinity. These copies are usually distorted by an affine transformation, but such a 
distortion keeps the length infinite. Similar remarks apply to space-filling curves, and to 
standard fractals such as the snowflake curve, Mandelbrot [13], and Falconer [4]. 

In other words, a path that is continuous but not smooth may not have a meaningful 
length. This is probably counterintuitive, because we have been trained from an early 
age to assume that every linear object does have a length. The ancient Greeks worried 
about the length of the circumference of a circle without ever defining what they were 
worrying about. It seemed obvious to them that any arc of a circle must have a length; 
their main aim was to find out what that length is. To do so, they tacitly assumed several 
plausible properties, such as finding the length by using polygons to approximate the cir- 
cumference. Eventually mathematicians tightened up the logic, leading to Definition 6.8 
above, and understood what can go wrong. 

To complete the story, smoothness is not actually necessary for a path or curve to have 
a well-defined length. There is a more general notion of a rectifiable curve, for which the 
‘approximating polygon’ definition is entirely satisfactory. However, the smooth case is 
all we need in this book. 


Regular Paths and Curves 


Just as we distinguish between a path y and its image curve, we must distinguish 
between the derivative y’(t) and a tangent line to the curve. The derivative can be inter- 
preted as the velocity vector at time ¢ for a point y(t) moving along the curve. If y'(t) 4 0 
it defines a tangent direction, hence a tangent line to the curve. When y’(t) = 0 it does 
not defines a tangent direction, so the curve may not have a tangent line. Section 6.7 
shows some of the things that can then happen. 

First, some standard terminology: 


DEFINITION 6.20. Let y : [a,b] > C be a smooth path. 
If to € [a,b] and y’(to) 4 0, then fg is a regular point of y. 


If to € [a,b] and y’(to) = 0, then #9 is a singular point of y. 


When the image curve has a well-defined tangent line, it looks smooth: see Proposi- 
tion 6.22 below. 

The above discussion leads naturally to a special type of path or curve that will be 
useful as we proceed, to relate the abstract theory to geometric intuition: 
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Figure 6.6 Tangents to a smooth path. 


DEFINITION 6.21. A regular path is a smooth path y : [a,c] > C such that y’() 40 
for all t € [a, b]. 
That is, every point on the path is a regular point. 


A regular curve is the image of a regular path. 


If y is regular, then by Proposition 4.18 a point on the tangent at y(4) is of the form 
y(t) + hy’ for any h € R, Figure 6.6. 

The standard paths L(f) (line) and C(¢) (circle) in Section 2.4 are regular. 

In Figure 6.6 the tangent line at y(t) is a good approximation to the curve given by the 
image of y, near that point. To formalise this idea, we compare the path y(t) for ¢ near 
some point fo € [a,b] with the corresponding tangent line. We can think of the tangent 
line as a path T in its own right, defined by 


t(to +h) = y(to) +hy'(t) (he R) (6.12) 
and compare it with 
V(to +h) (tnear fo) 


We now show that when / is small, these two paths, treating / as a parameter, are very 
close together for any given choice of h: 


PROPOSITION 6.22. Let y : [a,b] — C be a regular path. As h — 0, the expression 


1 
7 LY “to th) — t(to + h)] 
tends to 0. 


Before giving the simple proof, we interpret this result: it says that for sufficiently 
small h, the difference between the path y and the tangent path t tends to zero faster 
than h does. 


Proof. Since y is differentiable at fo, 


_ y(to +h) — y(to) 
lim 
h->0 h 


= y'(to) 
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Rewrite this as 


1 / 
fim 7 [yo +h) — (yo) + hy Go))] = 0 


By (6.12) this is the same as 


. 1 
lim —[y(to + h) — T(t +h)]=0 
h>oh 

The proof does not use y'(fo) 4 0; that is, the limit is still zero even when there is a 
singularity at to. However, in that case t(t9 + h) = y(to) for all € R. That is, t does 
not define a tangent line. So the result tells us nothing about the shape of the image of y 
near fo. 


Parametrisation by Arc Length 


Proposition 6.22 is a formal statement of the intuitive idea that a regular curve looks 
smooth near any point. It has a continuously turning tangent and a well-defined finite 
length. These properties are inherited by subpaths, leading to: 


DEFINITION 6.23. Let y : [a,b] — C be a regular path, with image curve C. Let 
to,t, € [a,b] with t < ft), and let y(t9) = c, y(t) = d. Then the arc length Lc(c, d) 
from c to d in C is the length of y|j1,,4,]; that is, 


t 
Lc(c,d) = / ly’(t)| de 
to 


We now prove that a regular curve can be smoothly reparametrised so that the param- 
eter ¢ is arc length, or a constant multiple of arc length if that is more convenient. Let 
the length of y be L. Define A : [a,b] — [0, L] by 


Ms) = Le(a, 8) = i ly’ @lat 


Then 

VW) = ly'O| AO 
so A is a strictly increasing function on [a, b] with a continuous derivative 4’ on [a,b], 
where A(a) = 0, A(b) = L. It is regular since i’(f) # O. It therefore has a strictly 


increasing inverse function 9 = A~!. Now pe : [0,L] — [a,b] and has continuous 
derivative 


p') = 1/0 £0 


for a < t < b, so pis also regular. 
The path 


t=yop:[0,L]>~C 
is regular, and 


TA) =yopokt)= y(t) 
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so T is areparametrisation of y, with parameter A(‘) = Lc(a,t). So t reparametrises the 
image curve C of y by arc length. Moreover, 


Ie’) = ly Olle’ Ol = lV Ol/ Ol = lv'Ol/lv'@l = 1 


So as t varies, t(#) moves along the path with unit speed. 

Multiplying the parameter by a non-zero constant scales the speed 1 to any non-zero 
constant speed, corresponding to a parameter that is a constant multiple of arc length. 
Any curve parametrised by arc length or such a multiple is regular. 


Regular and Singular Points 


Before tackling the intricacies of contour integration, we explain why the cases y’(t) 4 0 
and y’(t) = 0 differ significantly. 

Near a regular point, the image of y is a smooth curve in the geometric sense that it 
has a well-defined tangent direction (and this varies continuously). The curve may cross 
itself, but each separate segment near the crossing looks smooth. 

Near a singular point, this may not be true. Sometimes there is a sensible, indeed 
visible, tangent direction, but sometimes there is not. The geometry of y(t) when t¢ is 
near a singular point fo is highly sensitive to the precise behaviour of y(t) when f is near 
to. We give a series of simple examples to illustrate some of the possibilities. 


Figure 6.7 Examples of possible behaviour of a smooth path near a singular point. Left: Cusp. 
Right: Right-angled corner. 


Example 6.24. Suppose that y : [—1,1] > C where y(t) = 7 + it’. The derivative is 
y(t) = 2t+ 3it*, which is non-zero except at t = 0. The image is a semicubical parabola 
with a cusp at the origin, Figure 6.7 (left). Everywhere else, the image is a smooth curve. 
The image has a natural tangent /ine at the cusp point, along the real axis, because this is 
the limiting direction of tangents near the origin. However, the velocity vector reverses 
direction as it passes through the origin. 
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To make sense of what is happening at a singular point we compare it with the real 
case. In real analysis the graph of a function f : R — Rconsists of the points (x, f(x)) € 
R?. If f is smooth, the graph has a well-defined tangent — even at a critical point where 
f'(x) = 0. However, if we imagine the point y = f(x) moving as x increases steadily, y 
becomes stationary at any critical point. (Indeed, another term is stationary point.) The 
tangent at a critical point is horizontal. This is the same as the direction in which a point 
(x,f(x)) on the graph is projected to obtain the image f(x). So the tangent projects to a 
single point. 

The same kind of behaviour is happening at the cusp, but now y is a hybrid function 
with a graph y : [—1, 1] — C in coordinate form (, ? +i?) € Rx C. This is drawn in 
the upper left part of Figure 6.8, represented in three-dimensional real space as (¢, x, y) = 
(t, 7,3), where t,x, y all lie between —1 and +1. Projection onto the vertical (x, y)-plane 


3D (t, x, y)-space y=t project onto 


vyereeeVv~—/ > 
vertical (x, y)-plane 


a 


(x, y)-plane 


project § down 
onto (¢, x)-} plane 


(t, x)-plane t, y)-plane 


Figure 6.8 Example 6.24 represented in terms of the graph of the graph of the function 
y@) =2+if inR x C pictured in R3. 
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reveals the semicubical parabola (¢?, 3); projection down onto the (t, x)-plane gives the 
parabola (t,x); and sideways projection onto the (¢, y)-plane gives the cubic curve (t,x). 
The tangent at the origin is in the direction (1, 24, 317) = (1,0, 0), which points along the 
t-axis perpendicular to the complex plane and its projection onto the complex plane has 
length zero. 


Example 6.25. More surprisingly, the image can have a sharp corner, contrary to the 
intuition that it ought to at least look smooth. 

Define ¢ :R > R by 
0 (t <0) 


60 = | 2 (t>0) 


This function is continuously differentiable because both 0 and 1* have the same 
derivative, mainly 0, when t = 0. 
Now define y : [—1,1] ~ C by 


y(t) = PD) + if(—1) 


The image is then as shown in Figure 6.7 (right). 
The definition of ¢@ can be modified to make y infinitely differentiable, while retaining 
the right-angled corner. Just change ? to e~!/" as in (4.8). 


Next we compare two different cases where the image of y is a spiral. 


Example 6.26. Suppose that 
0 (t= 0) 
t)= : 
yy) tel/t (t> 0) 
Then y : [0,1] — C has a spiral image, Figure 6.9 (left). In this case, y is not 
continuously differentiable. Indeed as t + 0 from above, we have 


y(t) =e? 4 toe = (14) pe 


This goes round and round the circle of radius |1 +i] = V2 at an ever-increasing rate, 
so it does not tend to a limit as tf — 0. The one-sided derivative at t = 0 is not defined, 
let alone continuous. 


Contrast this spiral with the following closely related one: 


Example 6.27. Suppose that 
0 (t = 0) 
t)= ‘ 
y@) el/t (t > 0) 
Then y : [0,1] — Chas a spiral image, Figure 6.9 (right). The spiral is much tighter 
than the previous example, and now y is continuously differentiable. Indeed as t > 0 
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Figure 6.9 Left: A spiral path that is not smooth. Right: A smooth spiral path. 


from above, we have 
y(t) = (2 + ite“ 


which tends to 0 as t — 0. The one-sided derivative at t = 0 is defined, and the derivative 
is continuous there. As it happens, it is zero. 


We now consider the lengths of the spiral paths in Examples 6.26 and 6.27, which can 
be estimated by bounding the lengths of successive turns through 27. In Example 6.26, 
turn 7 has length lying between two constant multiples of 1/n. So the total length lies 
between the same constant multiples of }°,, 1/n, the harmonic series, which is infinite. 
In Example 6.27, turn n has length lying between two constant multiples of 1/n”. Since 
>! /n2 is finite, this spiral has finite length. 

These results are consistent with the lack of smoothness of Example 6.26 (indeed, 
since its length is infinite Proposition 6.11 implies that it cannot be smooth), and the 
smoothness of Example 6.27, which implies that its length must be finite. 


Contour Integration 


For readers who took the Riemann integral route, Section 6.2 defines the notion of a 
smooth path and deduces a formula for the integral along such a path. Those who opted 
for the short cut should refer to Definition 6.15. For all readers, we now generalise the 
notion of integration to allow paths made up of a finite number of smooth pieces: 


DEFINITION 6.28. Using the notation of Section 2.4, a contour is a path of the form 


YEN +n 


where y1, .. ., ¥, are smooth paths such that the final point of y,. coincides with the initial 


point of 4.4; forr =1,...,n—1. 
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Figure 6.10 The contour defined in Example 6.29. 


Example 6.29. Let 


vit) =t (t € [e, R]) 
y2(t) = R(cost + isin ft) (t € [0,7]) 
y3(t) = t (t € [—R, €]) 


ya(t) = e(—cost+isin#) (¢ € [0,7]) 


Then y = y, + v2 + ¥3 + Yq is the (closed) contour drawn in Figure 6.10. 


Definition of Contour Integral 


Integration along a contour is an easy extension of integration along a smooth path: 


DEFINITION 6.30. If D is a domain, f : D > C is continuous, and y = y} +---+ yn 
is a contour (so all y, are smooth), then the contour integral of f along y is 


fof ftot] et 
fie 


Yn 


and 


Livy) = LM) +--+ Ln) 


It is obvious that if a smooth path o is subdivided as 


0 =0,10) 


[r=fore fe 


so further subdivisions of the contours y),..., ¥, in the above definitions will not affect 
the values of the integrals. The contour integrals are therefore well-defined. 
The following standard properties hold, analogous to the real case: 


then 


132 Integration 


PROPOSITION 6.31. 


[osefafy = 
[i +f) = [si - [Pp (6.14) 
fanefs (cE C) (6.15) 


[te -fy (6.16) 


Proof. Equation (6.13) follows trivially from the definitions. The proofs of (6.14) 
and (6.15) depend on whether the reader has read Sections 6.1-6.3. If so, then 
Section 6.2 implies that 


SP.fi +fav) = SPA, v) + SPAY) 
S(P, cf, y) = cS(P.f,y) 
for any partition P, which imply (6.14) and (6.15). 
The reader who started from Section 6.8 can verify these formulas using known 
properties of real integrals, as follows. 


First, it is sufficient to verify them for a single smooth path. To prove (6.14), observe 
that 


b 
/ (f, +4) = / (y(t) + fol Oy" at 
: b b 
= il Ay @)y' (dt + / Aiy@))y' (pdt 
—~fyil ft 


Y1 v2 


using the additivity property for real integrals on the real and imaginary parts. 

The proof of (6.15) is slightly longer (a minor penalty for skipping Sections 6.1-6.3), 
because we must separate everything into real and imaginary parts. Letc = a + if, 
f(y") = UD + iV. Then 


b 
/ of = / (aw +iB)\(U() +iV(N)dt 
y a 
b 
= / (la U(T) — BV] + ila V(t) + BU(D))dt 
b b 
= [aU(r)— pviotar+i f [a V(t) + BU(d)))dt 


b b b b 
=a i U(T)dt — B : V(t)dt + ia / V(tdt + ip i; U(t)dt 
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b 
= (a + if) i (U(t) + iV(t))dt 


=e[s 


using standard properties of real integrals. 
Finally, we prove (6.16). If y : [a,b] — D is a contour, then the opposite path 
—y : [a,b] — D, defined by 


—y@=y(atb—1t) (te [a,b)) 


is also a contour. Now 


? d 
/ i fiviat+b D5 y(a+ b—ddt 
-y a t 


b 
=--f f(y(atb—1t)y'(at+ b—t)dt 


Substituting s = a+ b — t this becomes 
a 
-| F(v(s))y"(s)(—ds) 
b 


b 
_ / FO(S)y'(s)(ds) 


afi 
[re-fy 


The Fundamental Theorem of Contour Integration 


Therefore 


Integrals of complex functions are only occasionally computed by breaking them down 
into real and imaginary parts and calculating two real integrals as in the previous section. 
This technique is sometimes needed, but a far more efficient method is available for [. , f 
if we can find an antiderivative of f. This concept is so important that we give it a formal 
definition: 


DEFINITION 6.32. Let D be a domain. An antiderivative for a function f : D > C is 
a function F : D > C such that F’ =f. 


An antiderivative, if it exists, is unique up to an added constant, for if F’ = G’ = f in 
a domain, then (F — G)’ = 0 so F — Gis constant by Theorem 4.14. If we can find an 
antiderivative, then the integral can be computed immediately using: 
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THEOREM 6.33 (Fundamental Theorem of Contour Integration). If f : D > Cis 
continuous, F : D — C is an antiderivative of f, and y is a contour in D from z to 21, 
then 


[t= Fe) -Feo 
¥ 


Proof. Let w(t) = u(t) + w(t), W(t) = - Ut) +iV(d) for t € [a,b], where u,v, U, V are 
real. Then W’ = w if and only if U’ = u, V’ = v, and then 


b b b 
/ wear = f moar-+i | wt)dt 


= U(b) — Ua) +iV(b) —iV(@) 
= W(b) — W(a) 


Let w(t) = f(v@)y’(O. Since F’ =f, 
w() =F (yy) = WO 
where W(t) = F(y (4). Therefore 
b 
[i = / w(t)dt = W(b) — W(a) 
y a 


= F(y(b)) — F(v(@)) = F@i) — Fo) 


One remarkable feature of this result is that, in these circumstances, the integral does 
not depend on the path y, only on its end points. In contrast, we show in Section 6.10 
that the integral may depend on the path when the conditions of Theorem 6.33 fail to 
hold. 


Example 6.34. If f(z) = 2° and y is any contour from z = 0 to z1 = 1 +i, then 
F(z) = {2} is an antiderivative of f, and 


[ 2e= 4 a= 1d +93 =2 43: 
Y 


This example gives a visibly easier calculation than that performed at the end of 
Section 6.2 for the particular contour y(f) = P + it (t € [0, 1]). 

However, any euphoria we feel over this phenomenon must be tempered with the 
realisation that, unlike the real case where Theorem 6.2 (iii) tells us that a continuous 
function always has an antiderivative, in the complex case there are continuous func- 
tions without antiderivatives. Indeed, as we prove in Chapter 10, any function that is 
differentiable once in a domain is differentiable as many times as we wish. Now, any 
antiderivative F must be differentiable once, so it is differentiable twice. Therefore its 
derivative f = F’ must be differentiable. So no non-differentiable function f has an 
antiderivative. 
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Example 6.35. We know that f(z) = |z|? is differentiable only at the origin, so it is 
useless to search for an antiderivative when integrating this f — for example along the 
path y(t) = t” + it (t € [0, 1]). In this case, we return to the basic formula 


1 
[etac= [+P art ina 
y 0 


1 1 
=| Q + 2°dr+i (f + ?)dt 
0 0 


= [f+ tA) +iffh + FP) 
=?4+ hi 


Fortunately, many functions do have antiderivatives. For instance, by Corollary 4.5, 
any polynomial 


P(Z) =ao tarz+-++ + aynz" 


has antiderivative 


1 n+l 
n+1 Ink 


p(2) =agz+ xayzt+-+-4 


More generally, a power series has an antiderivative everywhere inside its disc of 
convergence: 


THEOREM 6.36. If f(z) = d-y—0 An(z — 20)” converges for |z — zo| < R, then 


CO 
Gn 
FD = —zo)tt! 
(z) a 71% Z0) 


also converges for |z — zo| < R, and F’ =f. 


Proof. It is sufficient to show that F(z) converges for |z — zo| < R, for then we may 
differentiate term by term by Theorem 4.20 to obtain F’ = f. 

We know that the power series pear an(z—z0)” converges absolutely for |z—zo| < R, 
by Theorem 3.19. For each n, either a, = 0 or 


lan(z— Zo)"/(n+1)| _ lz —Zol 


lan(z — zo)" | n+1 
tends to zero as n — ov, so by the comparison test et a (z—zo)"+! converges. 
If f(z) = baa an(z — zo)” has disc of convergence D, then for any contour y in D 
from z to Zz, 
[oe Co 
>> Gn(z2 — zo)" +! = ay(z1 — 20)" +! 
7 n+1 n+1 
Y n=0 n=0 


In particular, for any contour y in D from Zo to z, 


fy _ se an(z — z0)"* 
y n+1 
n=0 
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An Integral that Depends on the Path 


The Fundamental Theorem tells us that if a differentiable function f has an antiderivative 
throughout its domain D, then the integral [ . f is independent of the choice of path. This 
applies to z” for any integer n 4 —1, with antiderivative z’+!/(n + 1) in the domain 
C\{0}. It applies to 1/z*, 1/z?, ... but fails for 1/z, as we now show using formula (6.10). 


Example 6.37. Consider the problem of integrating 1/z between —1 and 1. The path 
along the real axis passes through the origin, where 1 /z is not defined, so we avoid this 
by choosing a semicircular path. 

If the semicircle lies above the real axis (where im z > 0) we can choose 


nt) =e (t € [x, 2r]) 


noting that e~’*” = —1,e~!2” = 1 and the —t makes the path run clockwise. Now 


i Eg [° : /(t) dt 
-—-az = —_. 
n < x vit) a 
20 1 : 
=| —(—ie") dt 
- et 


27 
= / —idt 
ca 


= [-in]Z” 


= —1(27 —7) 


= —-iT 


All very well, but we could equally sensibly choose the semicircle that lies below the 
real axis (where im z < 0). Now 


y2(t) = e” (t € [, 271]) 


noting that e'* — —1,e7 = | and the +1 makes the path run anticlockwise. A very 
similar calculation yields 


/ Lac= f : s(t) dt 
y2 % 7 sa nip”? 
2a 


1; 
HA si ie") dt 


which is slightly different. 
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At first it might appear that we made a mistake — it’s easy to get a sign wrong — but 
the second integral is clearly the complex conjugate of the first, because everything is 
reflected in the real axis. Since neither is real, they cannot be equal. 

This example does not contradict Theorem 6.33 because 1/z does not have an 
antiderivative in C\{0}. The obvious candidate is the logarithm, but by Section 5.7 
exp is periodic over C hence not one-one. Therefore any inverse ‘function’ to exp 
must be multivalued, as explained in detail in Section 7.3. So the logarithm is not an 
antiderivative over the whole of the domain. 

In Chapter 8 we show that we get a different answer here because the point z = 0, 
where 1/z is undefined, lies inside the region bounded by y, and j. This behaviour is 
closely connected to the multivalued nature of the complex logarithm. It has surprisingly 
profound implications, explored in Chapters 8-12. 

At any rate, this example should counteract any feeling that complex integrals are 
always independent of the path, Theorem 6.33 notwithstanding. 


The Gamma Function 


So far the special complex functions that we have encountered are all extensions to C of 
familiar real functions: polynomials, powers, trigonometric functions, the exponential 
and the logarithm. In this section we briefly discuss a less familiar function, the gamma 
function 1'(z). This also began as a real function and was later extended to the complex 
case. It has important applications, but we restrict attention to defining the function and 
establishing a few basic properties. It will play a key role in Chapter 17 when we sketch 
connections between complex analysis and number theory, culminating in the Riemann 
Hypothesis. 

The definition of the gamma function goes back to Euler, who initially defined it as 


the infinite product 
1 = 1\< z\-l 
r@ =- 1+-) (1+) 
(z) I] [ + *) sae 


n=1 
This infinite product converges for all z € C\ {0, —1, —2,...}. It defines a differentiable 
function on that domain, but we lack the techniques needed to prove this. Instead, we 
use another formula due to Euler, who showed that when rez > 0 this is equal to the 
integral 


(oe) 
T2a= / te dt (6.17) 
0 
We take this as our definition, in order to illustrate complex integration. Here we define 
i f= lm f 
a—>oo [0. [0,a] 
so the interval [0, co] is interpreted as the non-negative real axis. It is straightforward 


to verify that (6.17) converges, because the factor e~’ tends to zero rapidly. With this 
definition, the formula (6.17) defines I(z) only for zin the positive half-plane 
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Ct ={zeC:rez>0} 


We establish some basic properties of '(z) using methods from complex integration. 
First, we prove that the usual method of integration by parts generalises from real 
analysis to the complex case: 


THEOREM 6.38 (Integration by Parts). Let f,g:D — C be differentiable functions on 
the domain D. Then for any path y : [a,b] > D we have: 


[w = (fel? - [re (6.18) 
y y 
Proof. By Proposition 4.4 (iii) we have 
Gg) =f's+fe' 
so 
fs’ = (fa) —f'8 


Now integrate both sides over y and use the Fundamental Theorem of Contour 


Integration, Theorem 6.33. 
Now we prove the basic functional equation for the gamma function: 
THEOREM 6.39. Ifz € Ct then 
Pz+)l =z) rd)=1 (6.19) 
Proof. By (6.17), noting that z+ 1 € Ct, we have 
e,) 
T@+bH= / edt 
0 


Now use integration by parts to rewrite this as 


CO 
(z+ 1) = [-te~"]° +f ztele'dt 
0 


CO 
= lim (—x*e* — (0e°) 4 <f feat 
x—>>CO (0) 


CO 
=0+ zf tear 
0 
= 2 (z) 


To calculate I'(1), observe that 


CO 
r(1) = i tle—dr 
0 


= (=e 
=0-(-1) 
=1 


By induction, this formula leads immediately to a curious result: 
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COROLLARY 6.40. Ifn € N then 


Tm) =(n- 1)! 
In effect, the gamma function is a sensible way to define factorials for complex 
numbers, in such a manner that the basic formula (n + 1)! = (n + 1)n! continues to 
hold. 


Known Properties of the Gamma Function 


We record (but do not prove) several other properties of the gamma function. One is 
Euler’s reflection formula 


rd —zr@)= 


sin Zz 
which is a consequence of his infinite product definition and a standard infinite product 
for the sine. Setting z = } we obtain 


r 1y\\2 _ I = 
WG) sin 2/2 - 
Therefore 
(5) = Vr 


The duplication formula states that 
Pert 5) = 2) fal (22) 


The derivative of the gamma function is obtained from the integral formula (6.17) by a 
technique called ‘differentiating under the integral sign’, and leads to 


CO 
ro=| te" log t dt 
0 


(see Chapter 7 for the complex logarithm). This also proves that I is a differentiable 
function in CT. 
Let y be Euler’s constant, defined by 


y = lim (1 | : : fees ! logn) ~ 0577216 
n—->oo 2 3 n 
Then 
(1) =-y 


Weierstrass found another infinite product that works for all z € C \ {0, -1, —2,...}: 


re-[0 +9] 


It is also possible to successively extend Euler’s formula to rez > —1,re z > —2, and 
so on, using (6.19) in the form 


T(z) 
z—l 


Tz—-D= 


140 


6.12 


Integration 


¥ y 
or Nw 


Figure 6.11 Graphs of (from left to right) the absolute value, real part, and imaginary part of the 
gamma function. Here x runs horizontally, y runs into the page, and the value of the function 
concerned runs vertically. 


provided z— 1 # O. This defines on the whole of the complex plane, except for 
the points {0,—1,—2,...}, and shows that the extended function is differentiable on 
its domain. It also explains why negative integers and zero must be excluded from the 
domain, because I would become infinite there; that is, have a pole. This procedure is 
an example of analytic continuation, which will be introduced in Chapter 14. 

Graphs of |['(z)|, reP'(z), and im T(z) are shown in Figure 6.11, for this extended 
domain. 

The gamma function, and its poles, turned out to be an important ingredient in a rad- 
ical new development in complex analysis: an unexpected link with number theory. In 
the hands of Riemann and his contemporaries, this led to major advances in number the- 
ory, and a conjecture that remains one of the biggest unsolved problems in mathematics, 
the Riemann Hypothesis (see Chapter 17). 


The Estimation Lemma 


We now return to more general questions about complex integration, and prove a very 
useful technical result. Some results in mathematics are servants of the theory, of little 
intrinsic interest in themselves, perhaps even a little dull, yet quietly figuring in impor- 
tant decisions all the time. ‘Always a lemma, never a theorem’. One such result is the 
lemma we are about to prove. It is a simple idea, giving an upper bound for the size of 
an integral | i f| in terms of an upper bound on |f| and the length of y, but it arises 
time and again in the theory, applying subtle pressure at critical points in the proofs of 
important theorems. Perhaps not a theorem of great stature, it is certainly worthy of a 
name. It is: 


LEMMA 6.41 (Estimation Lemma). If f : D — C is continuous, y is a contour of 
length L, and | f(z)| < M for all zon y, then 


| | 
Jism 
y 
Proof. It is poetic justice that we have to give two different proofs, depending on 
whether the reader has read Sections 6.1—6.3. Version A is the more natural, but it 
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requires knowledge of Sections 6.2—6.3. For readers who took the short cut, Version 
B is provided. 

Version A. It is sufficient to prove the result for a smooth path y : [a,b] > D. Let P 
be the partition a = to < t) <--: <t,=)b, with t,_; < s; <t,. Then 


Yr v(t) = ¥ Gr) 


lr=1 | 


< fo) lv) — 1G 


r=1 


< ML(yp) 


where yp is the polygonal approximation to y with vertices y(fo),...,V(f). But 
L(yp) < L(y), so 
IS(P,f,¥)| < ML(y) (6.20) 


for all partitions P. 
Given any ¢ > 0 we can find a partition P, such that 


penn fs =e 
so 


fs <|S(Pef. YI te 
Y 


i 
fr 


and (6.20) gives 


< ML(y) +¢ 


for all ¢ > 0. Hence 


< ML(y) 


as required. 
Version B. First we establish 


b b 
| / (u(t) + mo < i: lu(t) + iv()|dt (6.21) 
a a 
for continuous real functions u, v. Let ris u(t)dt = X, i v(f)dt = Y. Then 


b 
i (u(t) + iv())dt = X + 1Y 
and 


b 
X?4+Y? = (X-iY)\(X +iY) = if (X —iY)(u(t) + iv(t))dt 


b b 
= / (Xu(t) + Yv(t))dt +i / (Xv(t) — Yu(t))dt 
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But X? + Y2 eR, so 
b 
/ (Xv(t) — Yu(t))dt = 0 
a 
and 
b 
be ae gee 1 (Xu(t) + Yv(t))dt 
a 
where the integrand Xu(t) + Yv(f) is the real part of (X¥ —iY)(u(@) + iv(H). So 


Xu(t) + Yv(t) < \(X —iY)(u(t) + iw))| 
= JX2 + Y\u(t) + iv(| 


From real analysis, 


b b 
i [Xu(t) + Yv(t)]dt < / Jf X + ¥2 |u(t) + iv(p)|dt 


which gives 
b 
X24 Y? </x2+4 y2 i |u(t) + iv(t)|dt 
a 
so that 
b 
J/xX2+Y2< : |u(t) +iv(t)|dt (6.22) 
a 
But 
b 
VX24+ YY? =|X+i¥|= i (u(t) + iv(t))dt 
a 
| 


| 
and with (6.22) this implies (6.21). 


Now 
{a= 


b 
i i wr on 


b 
< / Lr )lly/ldt by (6.21) 


b 
=a |My’(t)|dt by real analysis 
a 


= ML 


The Estimation Lemma will be used in a variety of ways. We now state two 
applications. 


PROPOSITION 6.42. Let y be a fixed contour and suppose that f varies. 


(i) If the maximum value of |f\| on y tends to zero, then Los — 0. 


More generally, if f — fo, then ys an I, fo- 
(ii) If the length of y tends to zero, the image of y tends to a point zo, and |f | remains 
bounded, then jy f—0. 


In particular this is the case if f is continuous at Zo. 
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Proof. To prove (i), observe that as the maximum value of |f| on y tends to zero, 
| i» f| < ML, which also tends to zero, so dod > 0. 

The second statements follows by considering f — fo. 

To prove (ii), first observe that if f is continuous at zo, then | f| is bounded in a neigh- 
bourhood of zp. So we may assume |/f| is bounded on y. Again, | if | < ML, which 
tends to zero. 


Consequences of the Fundamental Theorem 


If f is continuous and has an antiderivative F in a domain D, then we saw in Section 6.9 
that, for any contour y in D from Zo to z1, 


[rere — F(z0) 
y 


This result has interesting consequences. To explore them we need: 


DEFINITION 6.43. A contour y : [a,b] — C is a closed contour if y(a) = y(b). 


In real analysis, an integral ia f is always zero. In the complex case, if f has an 
antiderivative in its domain D, then an integral : f round a closed contour in D is always 
zero. (In the absence of an antiderivative, this need not be the case, as Example 6.37 
shows.) Moreover, if f has an antiderivative in D and zo, z; in D are different, it does not 
matter which contour y from Zo to z1 we use to integrate f: the result is always the same. 

These properties characterise the existence of an antiderivative: 


THEOREM 6.44. Let f be a continuous complex function defined on a domain D. Then 
the following conditions are equivalent: 


(i) f has an antiderivative in D, 
(ii) Lo = 0 for every closed contour y in D, 
(iii) ion depends only on the end points of y for any contour y in D. 


Proof. We have already established that (i) implies (ii) and (ii1). To show that (ii) implies 
(iii), suppose that v1, y are contours in D from Zo to z1, as in Figure 6.12. 


Then y; — y2 is a closed contour, so by (ii) — f =0. But 
[ or-fr-fs 
v-72 v1 2 
so 
f=] f 
v1 2) 


which is (iii). 
Finally, to show that (iii) implies (i), we fix z € D and for any z} € D we choose a 
contour y in D from Zg to z; and define 


F)= | f 
"1 
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Figure 6.12 Two paths with the same end points determine a closed contour. 


Because D is open, for some €; > 0, if |A| < e1 then the line segment A() = z1 + At 


(t € [0, 1]) lies in D, and 
Fa+n= [r+ fy 
VY Xr 


ee Ty. +t 


Thus 


For any constant c and contour y from z, to Zz», 


i cdz = (2 — 21) 
Y 


[ Paar 
x A 


F h)—F — 
(21 +h) — F(z) fla) = [> fe 4 
h a 


Now we use the Estimation Lemma. Continuity of f implies that, given ¢ > 0, there 
exists 6 > 0 (which we may take to be less than ¢;) such that 


SO 


Therefore 


Iz — z1| < 5 implies | f(z) —f(z1)| < € 
so that, when 0 < |h| < 6, for any z on the line segment A the integrand satisfies 
I(f(@) —f@1))/Al < €/IAl 
The length of A is |h|, so whenever |h| < 6, 


[5 “fa 4g] < an . 
~ (Al 
which implies 
F h)—F 
| PORN _Fepyfse (al <d) 
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But ¢ is arbitrary, so 
. Fy +h) — Fy) 
lim = 
h->0 h 


f(z) 


and F’(z1) = f(z1) for z1 € D, as required. 


Example 6.45. The function f(z) = |z| is not differentiable, so f. i f may (and as we 
show, does) depend on the choice of contour y between points. For instance, consider 
y,o from 0 to i given by 


y(t) =it (té [0,1)) 
and o = 01 + o2 where 


oi(t)=t (t € [0,1]) 
o(t)=e" (t €[0,2/2]) 
as in Figure 6.13. Then 


1 
/ |z|dz -|/ lit| -id¢t = [i?/2]g =i 
y 0 


but 


1 m/2 : : 
foetac= fo it-aaes [°° tel ie” ar = [P/2]p + [eG = b+i- Li 3 
o 0 0 


Figure 6.13 Two paths y and oj + 02 that yield different values of the integral of |z]. 


While complex functions that are merely continuous may be integrated using the basic 
formula, they offer little of interest. From now on, differentiable functions, and methods 
for integrating them, occupy centre stage. The resulting theory of integration has no 
natural real counterpart, because the real line lacks the flexible choice of path between 
points that occurs in the complex plane. 

With minor exceptions, this chapter completes the natural analogies between the real 
and complex theories of differentiation and integration. From now on, new possibilities 
will unfold. 
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Exercises 


1. Draw the contours y = [0,i],o = [0, 1] + [1,1]. Evaluate 


[ rezae [ rezaz 
y oO 


. Draw the contours y = [—i, i] and o where o (t) = e” (t € [—2/2, 2/2]). Evaluate 


i |z|dz i, \z|dz 
y o 


. Compute if Z+dz for y(t) = (1+ it (t € [0, 1]) using the formula 


1 
/ f(2dz = [ f(y@)y' dt 
Y 


and multiplying out the right-hand side and integrating the real and imaginary parts. 
If y = (0, 1] + [1,1 + i], what is f) z4dz? 


. Let n be a positive integer. If y(t) = zo + re” (t € [0, 2nz]) describe the contours y 


and —y geometrically. Compute 


/ : dz and / : dz 
y &— £0 -y &— 4 


. Fory@= e" (t € [0, ]) evaluate ty f for each of the following functions f: 


(i) 1/2? 
(ui) 1/z 

(ili) cos z 
(iv) sinh z 
(v) tanz 
(vi) (exp(z))° 


. Verify that the length function L satisfies 


L(-y) = Ly) Ly +o)=L(y)+L(o) 


where y,o are contours. 


. Let y(t) = zo + re" (t € [0,6]) be the arc centre zo, radius r, subtending angle 6. 


Using Proposition 6.11, verify that 
0 =L(y)/r 


which is the usual definition of ‘angle 0 in radian measure’. 


. Show that the length of the parabolic arc y(t) = at? + 2ait (t € [0,1]), where 


0<aeR,is 


a(V2 + log (1 + /2)) 


. If y is aclosed contour in C, the signed area enclosed by y is defined to be 


1 : 
21 Jy 
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Figure 6.14 What does the signed area represent for this contour? 


10. 


11. 


12. 


13. 


By writing the integral explicitly in the form [ MC —iv)(u' +iv), or otherwise, show 
that S is real. Show that its value for circular and triangular contours is + the usual 
area, and that both signs can occur. What is the geometric significance of the sign? 
What does the signed area represent for a contour such as Figure 6.14? 


Show that the signed area in Exercise may also be given by 


1 
s=— f imzae s= > | recae 
y Vy 


Write down the value of the integral i, im z, where y is the square y = [0,1] + 
[1,1 +i] +[1+i1,i+6,0). 


Let y(t) = e” (t € [0,1 /2]). Use integration by parts, Theorem 6.38, to compute 
(i) iy zsin zdz 

(ii) f, zeoszdz 

(ii) y ze dz 

(iv) if 2 sinzdz 

Let c(t) = zo + re! (t € [0,277]) be the circle centre zo, radius r > 0. If f is 

continuous in the domain D and z € D, use the Estimation Lemma to prove 


(i) lim / f(@dz = 0 
r—>0 C, 


(ii) lim I@ = 2mif (Zz) 
r—>0 Cc, — £0 
For each of the following functions f : D — C-, either state an antiderivative F : 


D — C or explain why an antiderivative does not exist. 
@ f@=2,D=C 

Gi) f@ = 1/22, D=C\ {0} 

(iii) f(z) = 1/z, D= C \ {0} 

(iv) f(z) =zsinz, D=C 

(v) f@) = lz?, D=C 

CigQ=z,d=C 
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Figure 6.15 Graph of x! for real x. 
14. Use Euler’s reflection formula 
r@rd-_= 


sin(z z) 


to find T(5). Use the identity "(z+ 1) = zI(z) to calculate ['(n + ) for alln € N. 
Gauss’s pi function has the value I(z) = I'(z + 1) and is known to be analytic 
inD=C\{neZ:n < O}. By translating the identity [(z+ 1) = zI(z) to 
T(z) = zII(z — 1) show that (0) = 1 and M(n) = n! forne N. 
Figure 6.15 shows the graph of x! for real values of x. Calculate the numerical 
values of (n + »! for n = 0, 1,2,3, —1, —2, —3, —4, and mark them on the graph. 


Angles, Logarithms, and the Winding 
Number 


So far, we have defined complex analogues of most of the basic special functions of real 
analysis — the exponential and trigonometric functions — and proved that many of their 
familiar properties extend to the complex case. Some new features also arise, such as 
the link between exponential and trigonometric functions and the periodicity of exp. 

There is one glaring exception to our catalogue of complex analogues: the loga- 
rithm. In real analysis, we define this as the inverse function to exp : R > Rt, so 
log : Rt — R. However, because exp is periodic in C, it is not one-one (injec- 
tive). Therefore it does not have an inverse function in the strict set-theoretic sense. 
This feature caused many of the historical disagreements about logarithms of nega- 
tive and complex numbers, exacerbated by the general view that log z is automatically 
meaningful when z € C, and all of its properties ought to generalise to C. 

Resolving these issues involves being far more careful when defining complex log- 
arithms. But the effort is worthwhile, because logarithms hold the key to many of the 
deeper and more powerful features of complex analysis. 

The problem of finding inverses to functions that are not bijections arises in real 
analysis as well. Inverse trigonometric functions such as sin~! are notorious for causing 
severe headaches, but the square root is the most obvious case. The square f(x) = x? is 
not injective as a function f : R — R. Itis also not surjective: negative real numbers are 
not squares of real numbers. The historical resolution was to define two square roots for 
every positive real number, +./x. Each is a one-sided inverse: (+,/x)* = x when x > 0. 
But neither is an inverse on the domain of f, which is all of R: their images are the 
positive reals and the negative reals, respectively, so they are not surjections onto R. To 
obtain genuine inverse functions, we restrict the domain of f to the non-negative reals, 
when ./x is an inverse, or to the non-positive reals, when —./x is an inverse. These two 
domains meet only at the origin. 

It would be natural to try the same trick in the complex case. But now there is no very 
natural way to restrict the domain and codomain to make the logarithm a bijection. A 
variety of more or less artificial choices exist, such as the ‘cut plane’ Cz where the neg- 
ative real axis is deleted, and these have their uses. To obtain a natural description of the 
complex logarithm, though, we use a more geometric approach. In classical terms, the 
complex logarithm is ‘multivalued’ — its value is not uniquely defined. The way in which 
its multiplicity of values fit together is closely analogous to the way that the measure- 
ment of an angle in radians gives not a single real number, but an infinite list differing 
only by multiples of 27. Indeed, these two instances of non-uniqueness are closely 
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related. In particular, the well-known complications when defining inverse trigonomet- 
ric functions in real analysis also arise because these functions are 27 -periodic, hence 
not injective. The same issue has to be resolved to make sense of the complex logarithm. 

We discuss these ideas below, and apply them to a topological invariant known as 
the winding number of a path relative to a point z) € C. In essence, this measures the 
total angle relative to zo traversed by a point moving along the path continuously from 
beginning to end. When divided by 27, this angle tells us how many times the path 
winds round the point zo. The winding number proves to be extremely useful in the 
deeper parts of the subsequent theory. 


Radian Measure of Angles 


We first relate the power series definitions of sine (5.7) and cosine (5.8) to the usual 
geometric ones, with the angle being measured in radians. This completes the proof that 
the power series represent the traditional trigonometric functions in the real case. 

Let 0 € z € C and write z in polar coordinates as z = re?, (r > 0). Since? = 
cos@ +isin0, Table 5.1 in Section 5.5 implies that for fixed r, as 0 increases from 0 
to 2/2, the real part of z decreases monotonically from r to 0 while the imaginary part 
increases monotonically from 0 to r. Since Iz|? = r7(cos? @ + sin’ 6) = r’, it follows 
that z traces out the first quadrant of a circle of radius 7 (in the anticlockwise direction) 
as 9 increases from 0 to 2/2, Figure 7.1 (left). 

Similarly, as 9 increases from 7/2 to 7, the point z traces out the second quadrant 
of a circle of radius r; from z to 37/2, it traces out the third quadrant; from z to 
37/2, it traces out the fourth; and thereafter it continues to go round the circle in the 
anticlockwise direction, Figure 7.1 (middle). 

We now compute the arc length from 1 to z along the circular path. For 6 > 0, let 
y(t) = re”, (t € [0,6]). Then y is a contour from 1 to z along the relevant arc. The 
length of y is 


Figure 7.1 Left: re’ traces out the first quadrant of a circle of radius r as 9 increases from 0 to 

a /2. Middle: For 6 € R, rel? repeatedly traces out the circle of radius r in the anticlockwise 
direction. Right: Traditional geometric definitions of sin@ and cos @ agree with the power series 
definitions. 
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8 6 8 
L(y) = i ly’(t)|dt = i lire” |dt = i rdt = rd 
0 0 0 


Thus 6 = L(y)/r, which is the standard definition of ‘angle measured in radians’. 
Figure 7.1 (right) then shows that the geometric definitions of sin @ and cos 9@, in terms 
of a right triangle with an angle @ radians, agrees with our analytic definition via power 
series. 

The 27-periodicity of sin and cos proved in Proposition 5.5 shows that this agreement 
extends to angles greater than 27, and to negative angles, which of course are measured 
in the opposite sense round the circle: clockwise. 


The Argument of a Complex Number 


We now look in more detail at the expression of a complex number z in the form re’’. 
Here r = |z|, so it is unique. However, when r = 0 any value of 6 leads to z = 0. 
And when r > 0 there are infinitely many possible values of 0, differing only by integer 
multiples of 277. Recall that the unique value in the range —z < 0 < m is called the 


principal value of the argument of z, and is denoted by 
arg Z 
This defines a function 
arg: C\ {0} :> R 


The function arg is not continuous on the negative real axis. This is a result of the need 
to choose @ uniquely: just above the axis @ is close to 7, just below it is close to —z. 
We can sidestep this problem (inelegantly) by defining the cut plane 


C, =C\{x«+iy:y=0,x <0} 


as in Figure 7.2. 
We claim that arg is continuous in the cut plane. This is plausible geometrically, but 
we must provide a rigorous proof. There is one technical difficulty: the bad behaviour of 


Figure 7.2 The cut plane C,. 
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inverse trigonometric functions (also caused by periodicity). We circumvent it by using 
several overlapping domains, on each of which the behaviour is sufficiently good. The 
proof that follows is an inelegant ‘bare hands’ reduction to properties of real functions: 
for a more elegant approach see Section 8.4. 


PROPOSITION 7.1. The function arg is continuous in the cut plane Cy. 


Proof. Let 
Di ={x+iyeC:y> 0} 
Dz = {x +iye C:x > 0} 
D3 = {x +iye C:y < 0} 
Then 


C =D; UD, UD; 


(draw a picture!) and in each of these domains we have an easy way to find the value of 
arg Z. 
To do so, observe that if 
Z=xt+i= el? 


and we wish to solve for r, 0, then 


Par ty sors x2 +9? 
and 


cos @ = ————— 
pe) +y 


sind = 
+ ye 


From properties of sin and cos it follows that if z € Dj, there is a unique solution for 0 
with 0 < @ < z. In this range cos is monotonic strictly decreasing and continuous, so 
its restriction to (O, 7) has a continuous inverse function 


cos! : (-—1,1) > (0,7) 


For z € D, we have 
_y fez 
argz = cos > — 
Iz| 
which, being a composition of continuous functions, is continuous. 
Similarly if x € D2 then —2/2 < 0 < 1/2. In this range, sin increases monotonically 


from —1 to 1, so it has a continuous inverse 


ed —m 1 
sin: : > a) 


Now 


which is continuous on D>. 
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Figure 7.3 The cut plane Cy. 


Finally, on D3 we can use 
cos! : (—1,1) > (—7,0) 


which is a different choice of inverse cosine from that used in D; because we are 
inverting the restriction of cos to a different interval. Again arg is continuous on D3. 
Since D,,D 2, D3 are open, arg is continuous in D} UD, U D3 = C,. 


It is sometimes convenient to proceed more generally. Let a € R, and let Ry be the 
ray 
Ry = {re :r >O0}CC 
Let 
Cg = C\ Ra 
as in Figure 7.3. Choose 
O=arg,z (z€ Cy) 
by the rule 
z=re®,r>0,a—-2n <0 <a 


Then by a similar method it may be shown that arg is continuous in Cy. 


The Complex Logarithm 


The complex logarithm poses similar problems, because the complex exponential 
function is not one-one. 
We wish to define log z, for 0 4 z € C, by 


w=logz if z=e” 


(We exclude z = 0 because e” is never zero, by Proposition 5.4.) 
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Let z = re’? 


deliberate!) We assume r > 0, —a7 <0 <27,s00 = argz. Then z = e” becomes 


,w = u+iv. (The mixture of polar and Cartesian coordinates is 


eae (7.1) 
Taking moduli, 
r=e" (7.2) 
Since r > 0 and r,u € R this has the unique solution 
u=logr 


where log is the real natural logarithm. Then (7.1) and (7.2) imply that 


ai? — 
so that 
v=0+4+2nn (néEZ) 
Hence 
logz = w= logr +10 + 2nz) 
or 


log z = log |z| + i(arg z+ 27) (7.3) 


The complex logarithm is thus ‘multivalued’ and therefore not a function in the strict 
set-theoretic sense. To obtain a genuine (single-valued) function we define the principal 
value of the logarithm to be 


Logz=log|z| +iargz (04zEC) 


(Note the capital ‘L’ for the principal value.) This function is not continuous on the 
negative real axis. However, for z € C,, the real and imaginary parts of Log are clearly 
continuous, so Log is continuous in the cut plane. 

Now we can compute the derivative of the logarithm. We have 


Log z — Log zo 


d ; 

— Logzo = lim 

dz L>% Z— ZO 
W— Wo 


w>wo e¥ — e0 


setting z = e”, 7 = e”9 and using continuity of Log. Thus 


1 ee od 
_—— = lim = —expwo =e? 
da Log z w>wo W— Wo dz 
dz 0 
and 
d ” 1 1 
—Logz = — = — 
dz oe e” =z 


and in general 


d 1 
Logz = (z€ C,) 
dz Zz 
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In the same way, in the cut plane C, we can define 
log, z = Log |z| + iarg, z 


and this is continuous, with derivative 


d 1 
= logy z =-— ¢eC,) 
dz Zz 


Once the logarithm is defined, powers 2 for complex a can be defined, in a cut plane, 
by reducing them to exponentials. The principal value of z*, when z £ 0, is 


2“ = exp(a Log z) 


Exercises 9-14 of Section 7.9 explore this function. For a more global view of z% see 
Chapter 14. 


The Winding Number 


Suppose that y : [a,b] — C \ {0} is a closed path. The choice of codomain here ensures 
that the path does not pass through the origin. If we imagine f, the parameter, to be time, 
increasing from a to b, and choose the argument 6(f) of y(¢) to vary continuously with 
t, then as y(t) moves along the path, then intuitively the total change in argument is the 
number of times that y winds round the origin (anticlockwise), multiplied by 277. (If y 
is not closed this can also be defined, but it need not be an integer.) 

We wish, of course, to make this idea precise: 


DEFINITION 7.2. A continuous choice of argument along a path y : [a,b] > C \ {0} 

(which for full generality we no longer require to be a closed path) is a continuous map 
6 : [a,b] — R such that 

io) — YO 

ly(@| 

Condition (7.4) merely says that 0(f) is one of the possible values of the argument of 

y(t). 


(7.4) 


Before proving that a continuous choice of argument exists, in Theorem 7.4 below, 
we consider a simple example. 


Example 7.3. Let y(t) = re” (t € [0,7 ]), Figure 7.4. 

A continuous choice of argument is 6(f) = t, because obviously re'/|re’| = el? = 
re'°(), But this is not the only possible choice: 6(f) = t + 27 works equally well, or 
indeed 0(f) = t + 2nz for any fixed n € Z. 

What we cannot do is ‘change horses in midstream’, say by choosing 


t (t € [0, 2 /2]) 


ls t+2n (te (x/2,7)) 


If we do that, condition (7.4) holds, but 6 is not continuous. 
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Figure 7.4 A semicircular path. 


The difficulty (though it does not take much getting used to) in handling continuous 
choices of argument is that it does not in general suffice to insist on some simple recipe 
for choosing a value of the argument, such as keeping angles between 0 and 27. Such 
a choice becomes discontinuous if the path winds round once anticlockwise, taking the 
argument to 27, and then continues in the anticlockwise direction. Then the argument 
jumps discontinuously back to near 0 when we want it to continue increasing from 27. 

It turns out that we can choose the argument at the starting point to be any of 
the infinitely many possible values; but this choice then determines all of the rest 
uniquely — and the arguments that arise need not stay within any predetermined range. 
It is intuitively obvious that this kind of choice can be made, and that it is unique once 
we decide where to start. Its proof is a little tricky: it uses the Paving Lemma. 


THEOREM 7.4. (i) Let y : [a,b] > C \ {0} be a path not passing through the origin. 
Then there exists a continuous choice of argument for y. 

(ii) Any other continuous choice of argument differs from this by a constant integer 
multiple of 27. 


Proof. By the Paving Lemma 2.33 we can subdivide y into finitely many subpaths 
yr (r = 1,...,n) such that each y, lies inside a disc D, C C \ {0}, Figure 7.5. If 
the centre of d; is at pre”, then taking a, = 6, +7, we find that D, C Cy, so arg, is 
continuous in D,. 

This gives a continuous choice of argument across y, for each r, but of course these 
choices need not fit together continuously. However, any choice of argument can be 
obtained from any other by adding a suitable integer multiple of 27. We can therefore 
adjust the arg, inductively so that they do fit together continuously, as follows. 

As usual, let y, be defined on the parametric interval [t,_1, t;]. There is an integer n2 
such that 


arg y, (t1) = arg, (t1) + 2n27 


Then there is an integer 13 such that 


atgy, (2) + 2mm = arg,,(t2) + 2n37 
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Figure 7.5 Paving a path. 


and so on inductively, choosing n+, so that 

argy (tr) + 2n-m = age, (tr) + 2nyp410 
Then we define 

O(t) = argy, (tr) + 2ny2m (t € [ty-1,tr]) 


with n, conventionally defined to be 0. Then @ is continuous. This proves that a 
continuous choice of argument exists. 
Next, suppose that 6* is another continuous choice of argument. Then we must have 


6*(t) = 0(t) + 2n()x 


where n(f) is an integer, possibly depending on ¢. But 


_ OW) — (0) 


nm 20 


is continuous, yet takes integer values. It is therefore constant. 


Note that a continuous choice of argument is defined on the parametric interval [a, b], 
not on the image of y. This means that if the path returns to the same point, with y(t;) = 
y(t2) but t; # tz, the arguments O(f;) and 0(t2) may be different. Intuitively, it is clear 
that this will happen if the path winds round the origin a non-zero number of times 
between f) and fo. 


Example 7.5. Let y(t) = re*”" (¢ € [0, 1]). 

A continuous choice of argument is given by 6(f) = 4at + 2nz, for any choice of 
n€ Z. Although y () = y(t+4) fort € [0, sh the choices of argument 6(t) and 0(t+ 5) 
differ by 27, because the path has travelled once round the origin (anticlockwise) 
between ¢ and ¢ + a 
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More significantly, y(0) = y(1), but the difference in arguments is 477, because in 
total the path has travelled twice round the origin (anticlockwise) between t = 0 and 
t=1. 


We can take advantage of this idea: 


DEFINITION 7.6. The winding number of a path y : [a,b] > C \ {0} round the origin 
is 
0(b) — O(a) 


WO) Oe 


for a continuous choice of argument 0 along y. 


By Theorem 7.4 (ii) the winding number is well defined; that is, any continuous choice 
of argument gives the same value. 

For any closed path, the winding number is an integer, because 0(b) — O(a) is an 
integer multiple of 27. 

The winding number takes the sense of the path into account: anticlockwise turns 
count as positive, clockwise turns count as negative. 


Example 7.7. Let y(t) = re~" (¢ € [0, 6]). 

A continuous choice of argument is 0(t) = —t. Then 
_ O(67) — 00) _ 
20 
and y winds three times round the origin in the clockwise direction. 


w(y,0) 3 


The winding number is additive: 


THEOREM 7.8. Let y, and y2 be two paths in C \ {0} such that the end point of y, is 
the start of y2. Then 


wiv + 2,0) = w(1, 0) + w(y2, 0) 


Proof. We may assume that 1, v2, and yj + y2 have parametric intervals [a, b], [b,c], 
and [a, c] respectively. Let 6 be a continuous choice of argument on 1) + y2. Then 


A(c) —0 

WY1 + y2, 0) =o“ 
6(b) — 0 

w(71, 0) a 

8(c) — O(b 

w(72, 0) =o) 


This is an extremely useful result, because it lets us compute the winding number of 
a complicated path by breaking it up into nice pieces and adding their contributions. It 
extends easily to show that 


WY] +-°+ + Y%.0) = w(1,0) +--+ + w(ym, 9) 
and that 
w(—y,0) = —w(y,0) 
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The Winding Number as an Integral 


If the path is smooth, we can specify its winding number as an integral along the path. 
Additivity of the winding number extends this formula to contours. First, consider a 
closed contour. 


THEOREM 7.9. Let y be a closed contour. Then 
1 1 
w(y,0) = ; —dz (7,5) 
ami Jy Z 


Proof. Subdivide the path into subpaths 1,..., y, as in Theorem 7.4, whose notation 
we now use. Each y, lies in a cut plane Cy, . If y, is defined on the interval [f-—1, t;] then 


1 
/ a = log, v(t-) — logy, v(t-—-1) 


r 


= log |y (¢,)| — log |y(G-1)| 
= i(argy |y(t-)| — argy, lv (r—-1)) 


As in Theorem 7.4 we make sure that argy, y(t-) = alge, Y (tr), that is, the choices 
of argument agree where the subpaths join together. Then, summing the integrals for 
r=1,... ,n, the real parts cancel out because y is a closed contour, and the imaginary 
parts add up to 27 w(y, 0), as required. 


The Winding Number Round an Arbitrary Point 


There is nothing very special about the origin as far as winding numbers are concerned. 
If y : [a,b] — Cis a path and z € C does not lie on y, we can define the winding 
number of y around zo. The easiest way to do this is to translate the origin, setting 


Mt)=yQ)—z (t€ (a,b) 
and calculating 
w(7Y,Z0) = w(T’, 0) 


For a closed path y, we get 


1 1 
wy, Z0) = [ 7& 


27i 
1 fre 
il, TH 
1 b / 
= | y@ dt 
21 Ja Y(t)— Zo 
1 1 
Oni yr-% 


This leads to: 
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DEFINITION 7.10. If y : [a,b] > C is a closed path and zp € C does not lie on y, 
then the winding number of y around Zo is 


dz (7.6) 


( ) = : 
WI ; —| : 
ue? 271 Jy Z— Z0 


The additivity of w(y,0) in Theorem 7.8 extends easily to w(y, zo) for fixed but 
arbitrary zo € C, using the same trick of translating the origin. 


Components of the Complement of a Path 


We consider how the winding number w(y, zo) of a given closed path y can vary as zo 
varies. 

By Proposition 2.41, the complement S of the image of y is open, and also each con- 
nected component of S is open. The winding number w(y, zo) is defined for all z) € S, 
and it is an integer-valued function on S. It is geometrically plausible that this function 
is constant on any connected component of S. We prove this analytically by showing 
that w(y, Zo) is a continuous function of zo € S. The desired result then follows, since 
any integer-valued continuous function is constant on any connected set. 

The proof that w(y, zo) is continuous in Zo is obtained by a direct estimate. Fix z € S. 
Since S is open there exists k > 0 such that |z1 — zo| < k implies z, € S. Therefore if z 
lies on the image of y then |z— zo| > k. Thus if |z1 — zo| < k/2 we have |z— z1| > k/2. 


Now 
[| ; | 
dz 
yLe— 2-21 


1 i Zl — Z0 | 
= dz 
2m \Jy (@—zozZ— 21) | 


“ |z1 — Zol 
mt k2 


1 
Iw(y, 20) — w(y,21)| = 
20 


by the Estimation Lemma 6.41. 
Given any ¢ > 0, take 6 = min(k/2,7k?¢/2L(y)). Then 


|z1 — zo| < 6 implies |w(y,z0) — w(y, 21)| < € 


Hence w(y, zo) is continuous in zo on S. 


Example 7.11. The path in Figure 7.6 has the winding numbers shown, around points 
zo in the components of S to which those numbers are assigned. The set S has a single 
unbounded component (Proposition 2.41) which we denote by O(y) (where the ‘O’ 
stands for ‘outside’). 

As Figure 7.6 shows, if zo € O(y) then w(y, %) = 0. This is easily proved from the 
integral formula (7.6), as follows. Let zp be ‘far from the image of y’, that is, assume 
lz — zo| => K for large K. Then the Estimation Lemma 6.41 shows that 


lw(y,Zo)| < L(y)/20K 
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Figure 7.6 Winding numbers for components of the complement of a path. 


which tends to zero for large K. But the left-hand side is a non-negative integer, so it 
must be equal to zero for large enough K. Since it is constant on O(y), it must be zero 
on the whole of O(y). 


7.8 Computing the Winding Number by Eye 


The somewhat complicated definition of the winding number may give the impression 
that it is complicated to calculate. This is not so, at least for paths that are contours made 
up of smooth subpaths. For contours the calculation is simple: trace your finger along 
the contour and count the number of turns. The point of this section is that this process 
can easily be formalised so that, in this case, what is obvious is also true. 


Example 7.12. We start with a path y whose image is a rectangle with vertices +2 + i, 
Figure 7.7. 


Figure 7.7 A rectangular path. 


If you want a formula, it is not hard to give one; for example let 


2—i+2it (t € [0, 1]) 
$27--4G-—1) el) 

2+i-2i(t-—2) (t €[2,3]) 
—2-i+4¢-3) (@e[3,4) 


yo= 
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This path is composed of four standard ‘line segment’ paths, one for each edge, Sec- 
tion 2.4. As the parameter t varies, the corresponding point moves along each edge at 
constant speed. (With our definition of a line L(#), the speed along an edge depends on 
the length of that edge because ¢ is taken in the interval [0, 1].) 

We want to calculate w(y, 0). 

First, here is how not to do the calculation. 

Break y up in the most natural way, into subpaths y, = y|,,-1,7 for r = 1,2,3,4. 
Then use the integral formula 


4 
1 1 1 1 
= d = _— 
w(y,0) ake Zz ami 2 7k 


Now (to take one subpath) 


1 oy 
i ae = / YOu 
n = 0 y(t) 
= [log(2 —i + 2i)}) 
= Log (2 + i) — Log (2 — i) 


because y; lies in C,, so the principal value Log is continuous on y;. Then 


Log (2 £1) = log |2 +1| +iarg(2 +1) 
= log V5 + isin“! (1/5) (7.7) 


SO 


/ rae = isin”! (1/5) 
YI & 


where the inverse sine is chosen between —z/2 and 7/2. 

You now have three similar integrals to evaluate. Add them, divide by 2771... , and do 
some very careful bookkeeping on the domains of the inverse trigonometric functions 
that occur. It can be done; in a sense it is not even difficult — but it is hardly to be 
recommended. 

It is a little better (but not much) to work from the ‘continuous choice of argu- 
ment’ definition. This starts you at stage (7.7) above for each subpath, with the same 
bookkeeping problems at the end. 

Here is a more civilised method. Divide y into subpaths 61, 52 as in Figure 7.8. 

Now w(y,0) = w(61,0) + w(62,0). Since 6 lies entirely within the cut plane C,, 
the principal value of arg is continuous on 6,. Hence, dotting all i’s and crossing 
all t’s, 


w(6,,0) = [arg(i) — arg(—i)]/27 
= [2/2 —(—1/2)]/22 
1 


2 


Similarly, 52 lies entirely within the cut plane Co, so argy is continuous on 52. Hence 
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Figure 7.8 A more civilised method. 


w(52,0) = [argo(—i) — argo(i)]/ 27 
= [-—2/2 — (—3m/2)]/20 


Adding, w(y,0) = 1. 


Clearly this process can be telescoped: the arg calculations merely confirm, in a 
predictable way, the obvious. 
We summarise this method as follows: 


(1) Break y into convenient pieces, each lying in some cut plane. 

(2) For each piece, compute the contribution to the winding number as the difference 
between the arguments of the end points, using the relevant continous arg — that is, 
arg, on Cy — or, geometrically, find the angle subtended at the origin by the two end 
points of the subpath, with the appropriate sign. 

(3) Add these contributions. 


It helps if the dissection of y into subpaths is performed by drawing a single 
straight line through the origin, because then the contributions are always +5. Thus 
in Figure 7.9, the line Z cuts the path y into four segments AB, BC, CD, and DA. 


Figure 7.9 Cutting the path along line L makes the calculation of the winding number easy. 
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Figure 7.10 A more complicated example where cutting the path along a suitable line makes the 
calculation of the winding number easy. 


Now 


w(y,0) = w(AB, 0) + w(BC, 0) + w(CD, 0) + w(DA, 0) 
1,1l,1,1 
=aragtagts 
=2 
Similarly, the path in Figure 7.10 gives 


w(y,0) = w(AB, 0) + w(BC, 0) + w(CD, 0) + w(DE, 0) + w(EF, 0) + w(FA, 0) 
+5 +0+(-5) +045 


re Ne 


This method is essentially the mathematical equivalent of ‘run your finger along the 
path and count half-turns’. The ingredients needed to make it rigorous are not compli- 
cated evaluations of args. The real crunch comes in showing that the path really does 
cross the chosen line L at the points A, B, C, and so on; that it crosses the line only at 
those points; and that it passes through these points in the stated order; and that the sense 
(clockwise or anticlockwise) of each subpath is as in the diagram. It is also worth bear- 
ing in mind that a smooth path may cross a line infinitely many times, so this method 
is not foolproof. But whenever (as is always the case in the sequel) y is specified in a 
straightforward way — such as by a simple formula, a polygon, or a series of straight 
lines and circular arcs — these ingredients are easily supplied. We then make no further 
fuss when we assert that a certain winding number takes a particular value. 

Comparison of the first ‘bad’ method in Example 7.12 with the final ‘good’ one gives 
a striking illustration of the dangers of blind ‘formula-crunching’. Complex analysis is 
highly geometric, and the geometry should not be despised. 


7.9 Exercises 


1. Compute the principal values of the arguments of the following complex numbers: 
1444 +184 93,4 +iByrsa +274 +i’. 
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2. Let arg z denote the principal value of the argument of z # 0, that is, —m < 
arg z < 7. For real x, y with x < 0 show: 
(i) limy_,o arg(x + ily|) = 7 
(ii) limy_,o arg(x — ily|) = —2 
Compute the corresponding limits when x = 0 and when x > 0. 


3. Compute the following principal logarithms: Log (3i), Log(—2i), Log(1 + i), 
Log (—1), Log (z!) where z = 2e!”/3, Log (x) for real x # 0. 
4. For 71,22 4 0 show that 


Log (z1z2) = Log (z1) + Log (z2) + 2nzi 


where n is an integer that need not be zero. Specify the values that n may take. Show 
that a logarithm of z; z2 is of the form 


log z1 + log 2 


provided that appropriate values of the logarithms are taken. 


5. The ‘Bernoulli paradox’ is as follows: 


(-y? =z 
hence 
2 log(—z) = 2 log(z) 
so 
log(—z) = log(z) 
What is the fallacy? 


6. Let f : C > C be given by f(z) = e!’z for constant real 6. Show that f rotates the 
complex plane through an angle 6. (Hint: prove that it is a rigid motion, that is, it 
satisfies | f(z) —f(Z2)| = |z1 — Z2| for all z1,z € C, and that it fixes the origin and 
only the origin. Then consider the value of f(1).) 

Show that the transformation f(z) = iz rotates the complex plane through a right 


angle, and describe the effects of the transformations f(z) = —z,f(z) = —iz as 
rotations. 
For any complex number A = re? describe the transformation f(z) = Az in 


geometric terms. 


7. In each of the following cases, for z = re!, draw the locations of z and 1/z in C: 
(i) 3ei7/2 
(ii) 2e!”//4 
(ii) fe'"/ 
(iv) 3e717 
Describe the transformation f(z) = 1/z in geometric terms. 
8. Let n be a positive integer. A complex number a is said to be an nth root of unity if 
ao" = 1. 
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10. 


11. 


12. 


13. 


14. 


(i) Find all nth roots of unity in polar coordinates and draw a picture. 
(ii) For n = 2,3,4, express the nth roots of unity in the form x + iy. 
(iii) If w1,@2 are nth roots of unity, show that the following are also nth roots of 
unity: 


m 
@; @02 w/a 


(iv) For given r,@ € R, (r > 0), find all z € C such that 


(v) Ifz? = 25, show that z} = wz2, where is an nth root of unity. 
For z, B € C,z £0, define the principal value of z’ to be 
a = exp(f Log) 


Compute the principal values of the following powers: 
1v2 (—2)¥2 joi B- Aj) B+ Ai)> 


Using the notation of Exercise 9, for z € C,, show that d(z’)/dz = Bz?-!. For 
fixed a € C,,, what is d(a*)/dz? 


Let a € R,B € C. Forz € Cy, define 
(2), = exp(B log, z) 


Compute all possible values of (ia, vare (3 - 4i)'*"), for various values of a. 
For fixed z, show that as a varies, (2°), takes only finitely many values when B 
is a rational number. 
If m,n € Z,n > O, show that 


(2! ")y yt = 2 


What is ((2”)""/")q? 


Using the notation of Exercise 11, for z € Cy, compute d((z8 Ja)/dz and 
d((*)e))/ dz. 


Describe the image of the functions f : C; — C geometrically where f(z) is given 
by the principal values of the following: 


Leta € R,6B € C,z €C,. By writing 6 = u + iv, find 
Jo} and arg (z" Jo 


Show that |(z*),.| is independent of if and only if £ is real. 
For positive integers m,n let f(z) be the principal value of z”/”. Describe f : 
Cy, — C geometrically. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 
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Express Log (1 + z) as a power series in z of the form 
ee) 
Log(1+z)=}Jaqz" (lel < R) 
n=0 


specifying the coefficients a, and the radius of convergence R. 


Show: 

(i) cos(—iLog (z+ Vz? — 1)) =z 
(ii) sin(—i Log Gz + Vz? — 1))) =z 
(iii) tan = Log — =2Z 


1 1 1 


Use these properties to express cos z, sin” ~ z, tan” z in terms of the logarithm. 


Show that all the values of cosh! z have the form 
cosh”! z = log(z + V2 — 1) 


for all possible values of the logarithm and square root. In the same sense, show that 
sinh! z = log(g + Vz + 1) 
Ler 
tanh! z= } log ( —— 
2 g 1— Zz 
LetD={zeC: nas € C,,}. Describe D geometrically. Define the principal 
1—Z 
value of the inverse tangent tan~! : D > Cto be 
tan! z= SiLog (=) 
1—Z 
(note the principal value of the logarithm). Show that tan~! is differentiable in D 
with derivative 1/(1 + z*). 


Draw the following paths and specify all the continuous choices of argument along 
them: 
(i) y(t) = 2e-¥ (te [0, 47) 
Gi) y@) =t+id — 4) (Ee [0, 1)) 
(iii) y(t) =t —1+if (t € [-1,1]) 
. t+il—f” (te [0,1)) 
as 1+it—1) (e[1,2) 
In each case compute the winding number of the path round the origin. 
Find the winding number w(y, z) for each of the following choices of y, Zo: 
(@) y(t) = 2e (te [0,27 ]); zo=1,3i 
di) y(t) =t+id —d (te [0,1])); zw=1+4+1,-i, 101 
For each of the following, draw a picture of the path and use a sensible method to 
compute [e 1/(z — Z0)dz: 
@) y=te" (re [x,52]) wo =0 
Gi) y= —it(re (0,1) w=! 
Gi) yO =it CE L-1L1) w=! 
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: 


Figure 7.11 Compute (by eye) the winding numbers of these closed paths round a point in each 
component A, B,C,.... 


8 (S} 


(iv) yO =ot+ [1,2] + 0+ [-2,-l] ~=0 
where a(t) = e(*—9, (t € [0, 2]) and p(t) = 2e-" (¢ € [0, 47r]). 
22. Compute (by eye) the winding number of the given closed paths round a point 
in each of the connected components A, B, C,...of the complement, drawn in 
Figure 7.11. 


Cauchy’s Theorem 


The Fundamental Theorem of Contour Integration, Theorem 6.33, tells us that if D is a 
domain and f : D — C has an antiderivative in D, then the integral of f along a path in 
D from zp to z; can be calculated using the formula 


fy = F(z1) — F(Zo) 
Y 


In particular, if y is closed, so z} = Z%, then 


[p= 


Cauchy’s Theorem goes further. It states conditions under which . f = 0 when there 
is no initial reason for f to have an antiderivative. There are many different versions of 
Cauchy’s Theorem — or, to be precise, many different theorems of this type. Cauchy was 
the first to publish such a result, announcing it in 1813 and getting it into print in 1825. 
Gauss was aware of the basic idea in 1811, but the accolade goes to Cauchy because he 
was the first to make it public. 

Both Gauss and Cauchy realised the basic fact that if y does not wind round points 
outside D (that is, the winding number round such points is zero) then i f = 0. For 
instance, f(z) = 1/z has the single point 0 outside its domain, so if the closed contour y 
does not wind round the origin, J f = 0, Figure 8.1. 


Figure 8.1 The integral of 1/z along this closed path is zero. 
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Figure 8.2 A star domain. 


Our main aim in this chapter is to establish the following version of Cauchy’s 
Theorem: 


If f is differentiable in a domain D and w(y,z) = 0 for all z ¢ D, then / f=0. 
y 


We start the theory rolling by proving the special case where the contour is a triangle. 

Then we prove a theorem that requires a restriction on the domain D rather than the 
contour y. we say that D is a star domain if it contains a point z, such that for any other 
point z € D the line segment [z,, z] C D, Figure 8.2. We then define F(z) = Stee 2] f 
and use the triangle version of the theorem to show that F is an antiderivative of f. This 
means that i f = 0 for any closed contour in a star domain. 

In particular, a disc is a star domain, and this leads to a very significant result. For a 
differentiable function f in a general domain D, we may not be able to find an antideriva- 
tive F : D > C. But if we restrict attention to any disc A C D, there is an antiderivative 
F : A — C. Thus an antiderivative may not exist globally throughout D, but it does 
exist locally in any neighbourhood N,(z) C D for any zo € D. 

Using the Paving Lemma 2.33, any contour y in an arbitrary domain D can be written 
as y = vy; +---+ y, where each subcontour y, lies in a disc D, C D. In the (star 
domain) D, we can choose a step path o; with the same end points as y, and (by the 
existence of an antiderivative in D,), f. oe = i f. If we seto = o, +-+-+0n, then 


Lf= ite Figure 8.3. 


Figure 8.3 Reduction of a contour integral to an integral along a step path. 


This reduces the investigation of [ , f to the case of an integral along a step path o, 
which can be attacked by geometrically inspired methods. 


8.1 


8.1 The Cauchy Theorem for a Triangle 171 


The Cauchy Theorem for a Triangle 


At the end of the nineteenth century, amongst many different versions of Cauchy’s 
Theorem, a most ingenious proof for a triangular contour was conceived by Eliakim 
Hastings Moore. Earlier proofs usually insisted that the function f should have a contin- 
uous derivative f’. By restricting the contour to a triangle, Moore’s proof requires only 
that f’ exists throughout D. It therefore provides a suitable basis for the development of 
the theory for all differentiable functions. 

For z1, 22,23 € C, let T(z1, z2,z3) be the set of points inside and on the triangle with 
vertices 21,22, 23. Formally, 


T(z1,22,73) = {2 €C 2 Z=Ayzy + Anz +A3z3,A; € R, Aj = 0,A, +A2 +A3 = 1} 


where j = 1, 2, 3. 
The boundary contour of the triangle, composed of the three line segments that form 
its sides, is 


OT (Z1, 22,23) = [z1, 22] + [22,23] + [23,21] 
Whenever there is no confusion we denote the triangle by T and its boundary by dT. 


THEOREM 8.1 (Cauchy’s Theorem for a Triangle). Let f be a differentiable function in 
a domain D. If the triangle T lies in D, as in Figure 8.4, then i rf =9. 


Proof. Let | def) =c>0. 
We prove that c = 0 by an indirect argument. First we subdivide T into four smaller 
triangles TY, 72 7) TH by joining the midpoints of the sides as in Figure 8.5. 
We know that 
4 


/ f=> / i 
aT my var 
Therefore 
| [ | | [ | 
c= < 
ae! |= Naas 
Le / 
D > / 
f 


Figure 8.4 A triangular path in D. 
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Figure 8.5 Subdividing the triangular path. 


so we must be able to choose r such that 


Cc 
If les 
ro 4 


(If more than one r satisfies this inequality, choose any of those — say the one with 
smallest r.) Define 7; = T”. Then 


fh f > ; and L(8T;) = 4L(8T) 
1 


Repeat this process of subdivision to get a sequence of triangles 


T > T; > T> ated > T, ++: 


satisfying 


| | > (qe and L(8T}) = (5)"L@T) (8.1) 
aT n 

Next we get another estimate for | /, ar, /| using the fact that f is differentiable. Since 
triangles are bounded closed sets, the intersection of the nested sequence of triangles T;, 
contains a point zo € D. Since f is differentiable at zo), for any ¢ > 0 there exists 6 > 0 
such that 


0 < |z — zo| < 6 implies <€é 


a —F Go) 
j Zz 


— £0 


Hence 
0 <|z—zo| < 6 implies | f(z) — f(zo) — f’(Zo)(z — z0)| < €lz— Zol 
For some integer N, every point of 7; is within 6 of zo for all n > N. Thus 
If(2) —f(@o) —f’(zo(z — Z0)| < elz—z0| forallze7T,,n>N 


For z € Ty, we trivially have |z — zo| < L(0T;), so the Estimation Lemma 6.41 gives 
i Lf(2) — fo) —f' oe — zo) az < eL(0T,) - L(0T,) (8.2) 
aT n 


But —f(zo) — f(z )(z — Zo) is of the form a + nz, where a,b are constants. This has 
antiderivative az + 5b2, so (8.2) reduces to 
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ie 


Comparing this with the earlier estimate (8.1) to get 


< eL(8Tn) 


(be < [orf | $ 8%? = Gy"eL@m? 
giving 
c < eL(a8TY* 


But ¢ is arbitrary and c > 0, so we must have c = 0, that is 


f=0 
aT 

This theorem deserves a commentary, because its analytic presentation obscures the 
fact that the basic ideas are very simple and very geometric. One is that the integral of f 
is additive on contours. That is, the contributions from the subdivided contours add up 
to that on the original one. The other is that a differentiable function is approximately 
linear (that is, of the form a + bz) near any point. 

If it were possible to make it exactly linear, locally, then we could take a fine sub- 
division making it linear on each subcontour; get zero for the integral on each subcontour 
because there is an obvious antiderivative; then add all these zeros to get zero for the 
original integral. 

Unfortunately this is not possible, and we are faced with adding a larger and larger 
number of contributions, each getting closer and closer to zero. But by estimating the 
rates of growth and shrinkage, we can show that errors introduced by assuming approx- 
imate linearity tend to zero fast enough to compensate for the increasing number of 
subcontours. 

It is an interesting exercise to rewrite the proof in a way that turns this informal 
description of the argument into a formal proof that keeps the geometry to the fore. 


Existence of an Antiderivative in a Star Domain 


We begin with a formal definition, previewed in the introduction to this chapter: 


DEFINITION 8.2. A domain D is a star domain if there exists z, € D, called a star 
centre, such that for all z € D the straight line segment [z, z] lies in D. 

(A star centre need not be unique. For example, a disc is a star domain and every point 
in the disc is a star centre.) 


In a star domain there is an obvious candidate for an antiderivative of a function f, 
namely the integral F(z) = See . jf. We now show that this is indeed an antiderivative, 
by applying Theorem 8.1. 


THEOREM 8.3. If f is differentiable in a star domain D with star centre z,, then F(z) = 
tee af ts an antiderivative of f in D. 
ok 9 
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Figure 8.6 These three points in a star domain form a triangle inside the domain. 


Proof. The domain D is open, so for any z1 € D there exists ¢; > O such that 
Ne, (z1) € D. If |h| < €1, the triangle T(z,.,z1, Z1 + h) lies entirely in D, Figure 8.6. 
Now Theorem 8.1 gives 


es es 
[zx,21 ] [z1 z1-+h] [zi +h, zx] 


Fea+ [ f-Fla +h =0 
[21.21 +h] 


This can be written as 


or 


F()— Feith) 1 i 
h h [21,21 +h] 


The proof now proceeds in the same manner as Theorem 6.44. For a constant c € C, 


/ cdz=ch 
[21,21 -+h] 


PQ) =F th) 9) _ | f@ -fla) 4 (8.3) 
h [21,21 +A] h 


By continuity of f, given ¢ > 0 there exists 6 > 0 such that 


hence 


|z — z1| < 6 implies | f(z) — f(z1)| < € 
If z lies on the line segment [z1,z1 + /], 
|h| < 6 implies |z—z1] <6 solf(z)—f(z)| <¢€ 


The Estimation Lemma 6.41 gives 


| f@)~Fen 4] ge lapis, 
I Ez cite] h | IAI 


and from (8.3), if |h| < 6 then 


ae — Fat+h) 
h 


f(z) 


<€ 
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Since ¢ is arbitrary, 


fia) EE 56 


h>0 h 


sof =f. 


COROLLARY 8.4. [ff is differentiable in a star domain D, then Lf = 0 for all closed 
contours y in D, and the integral of f between any two points in D is independent of the 
choice of contour between the points. 


Proof. This is an immediate deduction from Theorems 8.3 and 6.44. 


An Example — the Logarithm 


In Example 6.37 we showed that the integral of 1/z from —1 to 1 depends on the path 
between those points. We now examine how to restrict the domain to get round this 
problem. Later we describe a more elegant approach. 

The function 1/z is differentiable in the domain C \ {0}, but this is not a star domain; 
moreover, there is no antiderivative on this domain. However, if we restrict the function 
to any star domain D C C \ {0}, the results of the previous section apply in D. It is easy 
to see that the cut plane C, is a star domain, with star centre z,. = 1. Figure 8.7 makes 
this clear geometrically, and a rigorous proof is straightforward. 

Because 1/z is differentiable in C, it has an antiderivative there, the principal value 
of the logarithm, which satisfies 


1 
Log z1 =) —dz 
[Lz] < 


We exploit the fact that this integral is independent of the path, provided the path is 
contained in C,, and integrate along a specially chosen contour that is not the line 
segment [1, zi]. Instead we proceed as follows. Let z1 = re” where r > 0,-m <0<7, 


Figure 8.7 The cut plane C,, is a star domain with star centre z, = 1. 
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Figure 8.8 A more convenient path to integrate 1/z in Cz. 


and define y = y; + y where y is the line segment [1,r] and y(t) = re" (t € [0,6]). 
See Figure 8.8. 


Then 
0 1 1 
Logre’ = —dz+ —dz 
nm % y2 % 
ri il 
-| —dt +f 
1 t 1 rev 
= logr +10 


This gives an alternative approach to the complex logarithm. In particular, it provides a 
far more satisfying proof of the continuity of the argument in the cut plane C,, than the 
prosaic version given in Section 7.2. Namely: the function Log is differentiable in C,, 
hence continuous, so its imaginary part (which is the argument of z) is also continuous 
there. 


Local Existence of an Antiderivative 


Let f be differentiable in an arbitrary domain D. We know that f may not have an 
antiderivative that works throughout D. But D is open, so for any zo € D there is an 
open disc N;(zo) € D. A function F : N-(zo) — C such that F(z) = Ff (2) for all 
z € N,(Z) is called a local antiderivative of f. A disc is a star domain, so Theorem 8.3 
tells us that f has a local antiderivative in every disc in D. 

This immediately simplifies integration of a differentiable function along an arbitrary 
contour, because we can integrate along a step path instead: 


LEMMA 8.5. If y is a contour from Zo to z in a domain D, then there exists a step path 
o from z to z in D such that i = [.,f for every function f that is differentiable in D. 


Proof. By the Paving Lemma 2.33, y = y, + --- + Yn where each y, lies in a disc 
D; © D. Let o; be a step path in D; from the initial point of y; to its final point. Then 
by Corollary 8.4, in the star domain D,, we have ea = (one Now o, = 0, +-::+ 0, 
is a step path from Zo to z; in D, as in Figure 8.3, and 


[rE [r-S [r= fs 


r=1 
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Cauchy’s Theorem 


We build up to Cauchy’s Theorem in stages. First, consider a rectangle 
R={x+iyeC:a<x<b,c<y<d} 
with boundary contour 
OR = [21,22] + [z2,23] + [23,24] + [24,21] 
where 71 =a+ic,2 =b+ic,73 =b+id,%4 =a-+ id, as in Figure 8.9. 
LEMMA 8.6. If D is a domain, f is differentiable in D, and R C D, then dont = 0. 


Proof. Insert the opposite diagonal contours [z,,z3] and [z3, z,]. Use Cauchy’s Theorem 
for a triangle, Theorem 8.1, twice, and add. The integrals along the two diagonal paths 


cancel. See Figure 8.10. 


Now we take an arbitrary closed step path o and insert extra line segments to make 
up a collection of rectangles. To do this, extend all horizontal and vertical line seg- 
ments of o to infinity, breaking the plane into a finite number of rectangles, some 


finite, Rj,...,R,, and some infinite, Ry41,..., Rm». (What we mean by ‘rectangle’ in 
ice 
R 
ide = 7 
* J 


Figure 8.9 Contour defined by the boundary of a rectangle. 


D 


Figure 8.10 A rectangular contour is the sum of two triangular contours. 
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R 


Figure 8.11 Extending the edges of a step path to tile C with finite or infinite rectangles. 


the infinite case is clear from the figure. A ‘side at infinity’ is missing.) Figure 8.11 
shows an example where k = 9,m = 25. 
In the interior of each R, choose a point z,, and define 


Yn = WO Zn) 


This is independent of the choice of z,, in R, because the interior of R,, is connected. 

Say that R, is relevant if v, 4 0. Then R, is relevant only when o winds round it. 
In particular, all of the infinite rectangles Ry+1,...,Rm are irrelevant, because they lie 
in the infinite component of the complement of o. (In Figure 8.11 the only relevant 
rectangles are R3, Rs, R7, Rs.) 

We now demonstrate that o can be expressed in terms of the boundary contours of 
relevant rectangles, by taking v, copies of each boundary dR, when v, > 0, and —vy, 
copies of each —dRy when vn < 0. 

For instance, in Figure 8.11 we take —dR3, 0Rs5,0R7, 0Rg. Cancelling the opposite 
segments common to 0R5,0 Rs and to 0R7, 0Rx we finish up with the step contour o. 

To show this works with an arbitrary closed step path o, it is convenient to use the 
notation ny to represent n copies of y when n > 0 and —n copies of —y when n < 0. 
The most straightforward procedure is to start with the set of contours 


A = {-—o0,v0R1,..., eRe} 


and show that the cancellation of opposite line segments L, —L, wherever they occur, 
removes them all. 

To prove this, suppose that when as many pairs as possible have been cancelled, 
There remain g 4 0 copies of some line segment L. Then Lis a side of at least one finite 
rectangle R;, and by allowing q to be negative if necessary, we may suppose that L is 
traced in the same direction as dRs. Let Ry be the rectangle on the other side of L (which 
may be a finite rectangle or an infinite one), as in Figure 8.12. 

The set of closed contours 


B=AU {—q@R3} 
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Figure 8.12 Rectangles either side of L. 


then simplifies to have no copies of L. If we compute the winding numbers of the con- 
tours in B round z, € R, and z, € R,, then the absence of L from the simplified set of 
contours tells us that the two winding numbers are the same. But the winding number 
round Z, is 


vyw(OR1, Zs) + +++ + vRWCORK, 25) — W(O, Zs) — qwORs, Zr) = —q 
and around z;,. it is 
vy WOR], Z-) + +++ + VEWOORK, Z) — WG, 2) — QwOOR;, Z,) = 0 


Hence q = 0 as required. 
This confirms that o may be obtained by taking v, copies of each relevant contour 
dR» and deleting opposite line segments wherever they occur. 


LEMMA 8.7. Let o be a closed step path in a domain D such that w(o, z) = 0 for all 
z € D. Then, for any function f differentiable in D, we have A ot = 0. 


Proof. We express o in terms of relevant rectangles, as above, and show that every 
relevant rectangle R,, must lie inside D. By definition, if z lies in the interior of R, then 
w(o ,Z) = Vn # O0soz € D. On the other hand, a point z on the boundary dR» either lies 
on o (and hence in D) or it is in the same component of the complement of o as points 
in the interior of R,, whence w(o, z) = vy ~ O and againz € D. 

By cancelling contributions along opposite contours, 


[r= La 


Integrals on the right need be considered only when v, 4 0, in which case R,, is relevant 
so lies inside D. Then, by Lemma 8.6, 


f=0 


ORn 


and [ f =0. 


We now reach the focal point of this chapter: 


THEOREM 8.8 (Cauchy’s Theorem). Let f be differentiable in a domain D and let y be 
a closed contour in D that does not wind round any points outside D (that is, w(y, z) = 0 
when z ¢ D). Then Ge = 0. 
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Proof. By Lemma 8.5 there exists a step path o in D such that I, o= L @ for any 


function ¢ that is differentiable in D. In particular, t= i f. 
For z ¢ D the function ¢(z) = 1/(z — Zo) is also differentiable in D, so 


1 1 
w(o, zo) = safe = ml? = w(y,Z0) = 0 


By Lemma 8.7, 1 fae Hence Lf =f =0. 


Applications of Cauchy’s Theorem 


The version of Cauchy’s Theorem that we have just proved has far wider applications 
than simply showing that integrals round certain closed contours must be zero. It lets us 
calculate non-zero integrals as well. For example, suppose that y; and y2 have the same 
winding number round all points outside D, so that w(/1, z) = w(y2, z) when z ¢ D. Let 
z1 be the point where y; begins and ends, and let z be the point where y begins and 
ends. Take any contour o from z to z in D, Figure 8.13. 


D 


Figure 8.13 Creating a closed path from two paths. 


Let y = yi +0 — ~ — Oo. This is a closed contour in D, and w(y,z) = 0 for 
z € D because the winding number is additive. By Cauchy’s Theorem, 1, f = 0. 


Therefore 
[r+ fe-[o- [r=0 


and 


If we wish to compute i jf, we may be able to find another contour j as above, for 
which ‘bs f is simpler. 

The technique of introducing opposite contours o ,—o whose contributions eventually 
cancel is also very useful. In particular, it lets us prove a much more powerful Cauchy- 
type theorem: 


8.6.1 
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Figure 8.14 Joining a set of closed contours to form a single closed contour. 


THEOREM 8.9 (Generalised Version of Cauchy’s Theorem). Suppose that y1,...,Vp 
are closed contours in a domain D such that 


WV1,2) +++: +Wn,z)=0 forallz gD 
and let f be differentiable in D. Then 


Proof. Suppose that y, begins and ends at z; (1 < r < n). Choose any zo € D and 
contours 01,...,0,1n D that join zg to 1, ...,Z, respectively, Figure 8.14. 
Then 


Y HOY OF + On + Yn — Op 
is a closed contour beginning and ending at zo, and 
w(y,z)=0 forallz gD 
(again, by additivity of the winding number). By Cauchy’s Theorem, [. , f = 0, so 
n 
vfrtfr-fr=0 
r=1 ° Or Yr Or 


Therefore 


yf r=0 
r=1 "Yr 


Cuts and Jordan Contours 


Given two closed contours y,,y2 in D and a contour o in D from a point on y; to a 
point on y, we call the pair of contours o ,—o acut from y; to v2. There is a historical 
reason for this name. Earlier versions of Cauchy’s Theorem were invariably proved for 
Jordan contours. A closed Jordan contour is a closed contour y : [a, b] + C that does 
not cross itself. That is, 


a<t, < tz < b implies y(t) ¥ y(t2) 


182 


Cauchy’s Theorem 


It is intuitively obvious, but difficult to prove analytically, that every closed Jordan con- 
tour y separates the plane into two components, the points O(y) outside y and the points 
I(yv) inside y, and that these are both connected sets, Figure 8.15. 

Early versions of Cauchy’s Theorem state that if y and (7) lie in D, then [ 7 f =0.In 
applications it was then necessary to introduce ‘cuts’ to manufacture Jordan contours. 
For instance, suppose that f is differentiable everywhere except at z) and two Jordan 
contours yj, 72 both wind once round zp as in Figure 8.16 (left). Two cuts are made in 
Figure 8.16 (right) to create two new Jordan contours so that f is differentiable inside 
each of them. The integral round each new Jordan contour is then zero, and cancelling 
the contributions from the cuts we get ie f= i f. Such methods usually relied on 
geometric intuition, sometime unsupported by analytic proof. 

By introducing the winding number to link analysis to geometry, such pitfalls can be 
avoided, and there is no longer any necessity to restrict the theory to Jordan contours. 
Instead we can define the inside of any closed contour to be 


I(y) = {z€ C: wy, z) £0} 
and the outside to be 


O(y) = {zE C: w(y,2 = 0} 


Figure 8.15 Closed Jordan contour and its inside and outside. 


Figure 8.16 Left: Two Jordan contours. Right: ‘Rewiring’ the contours using cuts. 
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OY, 
Oe) 


Figure 8.17 Inside and outside components of the complement of a closed contour. 


In general, neither (vy) nor O(y) need be connected. For example, in Figure 8.17, O(7) 
has two components O; and O2, and /(y) has three, 11, 2, and i. 

The Jordan Contour Theorem (which we do not prove, but see Exercise 9 below for 
the step path case) then says that the outside and inside of a closed Jordan contour are 
both connected. 

For an arbitrary closed contour we can rephrase Cauchy’s Theorem as stated in 
Theorem 8.8 to get: 


THEOREM 8.10. Let f be differentiable in a domain D. If a closed contour y and its 
inside I(y) lies in D, then Le =0. 


Simply Connected Domains 


Theorem 6.44 states that if D is a domain and f : D > C is differentiable in D, then 
i f = 0 for all closed contours y in D if and only if f has an antiderivative in D. We 
can now state the precise conditions under which this happens for all functions f that 
are differentiable in D. 


DEFINITION 8.11. A domain D is simply connected if w(y,z) = 0 for every closed 
contour y in D and every z ¢ D. 
Equivalently, [(y) C D for every closed contour y in D. 


Intuitively, a domain is simply connected if it has no holes. 
We now have: 


THEOREM 8.12. If D is a domain, then Jot = 0 for all closed contours y in D, if and 
only if D is simply connected. 


Proof. If D is simply connected then Cauchy’s Theorem implies that if f = 0 for any 
closed contour y in D and any f : D > C that is differentiable in D. Conversely, if 
D is not simply connected, there exists a closed contour yo in D and z € D such that 
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w(yo, Zo) # O. Let O(z) = 1/(27i(z — zo)). Then @ : D > C is differentiable in D and 
S, & = wo, 20) # 0. 


Exercises 


1. State which of the following are star domains, proving the existence of a star centre 
for those that are, and justifying your answer for those that are not. 
@) {z€C:z4%x+0i where |x| > 1} 
Gi) {ze C: |z| > 1} 
(iii) {2 € C:z 4 e* fort € [0, z]} 
(iv) {z€ C: |z| > 1 and either imz > O or rez > 0} 


2. Let D = C \ {0}. For z € D, specify a local antiderivative in some neighbourhood 
of zo for each of the following functions: 


(i) 1/z 

(ii) 1/2? 

(ii) (+ D/2 

(iv) (cos z)/z 

(v) (sinz)/z 

3. Let 

v(t) = —1+ hel (¢ € [0, 27]) 
yo(t) = 1+ Je" (t € [0, 2]) 
y(t) = 2e~" (t € [0, 27]) 


If f(z) = 1/(z? — 1) use Theorem 8.9 to deduce that 


[y . — oI 


Interpret this statement in terms of the winding numbers of y, y1, y2 round 1, —1. 


4. Show that D = {z € C : z # £1} is not simply connected. Let 
Lj=f+ieC:y=0,x< —-l} 


I2 ={x+iye C: y=0,x> 1} 

Do = D\ {L; UL2) 
Show that Do is simply connected. Is it a star domain? Does f(z) = 1/(z2 — 1) have 
an antiderivative in Do? In each case justify your answer. 


5. Prove that every quadrilateral whose edges do not cross is a star domain. What about 
pentagons? Hexagons? 

6. Let D = {z€ C: z # +i} and let y be a closed contour in D. Find all the possible 
values of i 1/(22 + 1)dz. If o is a contour from 0 to 1, find all possible values of 


f, We + Daz. 
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7. Let y; = 8; +L—S2 —L,y2 = 8; + L4+ Sz —L, where 


10. 


Si(t) = e” (t € [0,27]) 
S2(t) = 2e" (t € [0, 27r]) 
L= [1,2] 
Describe the inside and outside of y; and 7». 
Let f(z) = (cos z)/z, By writing cos z as a power series and considering f(z) = 


(1/z) + g(z), or otherwise, compute /, i f and f. 6 f. Compare the results with 
Theorem 8.10. 


. Let D = {2 € C: 2 AZz,...,2 # zx} where z € C, and suppose that f is 


differentiable in D. Show that for any closed contour in D, 


where S; is a sufficiently small circle centre z, and n, is an integer. 
If lim,_,,,(z — z,)f(z) = a, € C for r= 1,...,k, show that 


k 
[i = S° 2xinya, 
Y r-1 


. Jordan Contour Theorem for Step Paths. Define a crossing of a closed step path 


o : [a,b] > D tobe a point o(t) in the image of o, such that 
o(t1)=o(h) (@<t <t, <b) 


Say that o is simple if it has no crossings. 
Prove the Jordan Contour Theorem for step paths: Let o be a simple closed step 
path in C. Then 


(i) C \ G has precisely two connected components. One (call it J(7)) is bounded, 

the other (call it O(c )) is unbounded. 

(ii) Either w(o,z) = 1 for all z € (0), or w(o, z) = —1 for all z € I(o). 

di) If z € O(c), then w(o, z) = 0. 

Cauchy’s Theorem for a Simple Closed Step Path. Prove Cauchy’s Theorem for 

a simple closed step path o: If o does not wind round any point not in D, then 

io. 

Hint: One way to do this is as follows: 

(1) Construct a set of rectangles R,; as in Section 8.4. 

(2) Prove that a rectangle R, is relevant if and only if it lies in J(c). 

(3) Prove that J(0) C D, observing that by definition J(o ) winds around any point 
in I(o). 

(4) Prove that /, R, f = 0, by observing that R, is a star domain. 

(5) Define a closed path t, round the boundary of each rectangle, having the same 
winding number as o round any point inside the rectangle. (This is either always 
1 or always —1 by Exercise 9 (11).) 
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(6) Show that 


nt ad 


because edges of the R, that are not contained in the image of o cancel is pairs. 
(7) Deduce Cauchy’s Theorem for o. 
11. Suppose that 01, 0) are simple closed step paths in C, and that the image of oj is 
contained in J(02). Prove that J(o,) C I(o2). 


9.1 


Homotopy Versions of Cauchy’s 
Theorem 


The results in this chapter are not essential for any later parts of the text, except for 
Chapter 16. 


Cauchy’s Theorem is one of the foundation stones of complex analysis. It resolves 
two apparently conflicting features of contour integrals is f when f is fixed but y can 
change. On the one hand, y can be altered in fairly drastic ways with no effect on 
the integral — for instance, replacing a general path by a step path. On the other hand, 
changing a semicircular path in the upper half-plane of C into a semicircular path in the 
lower half-plane of C completely changes the integral of 1/z between —1 and 1, as in 
Section 6.10. 

The version of Cauchy’s Theorem in Theorem 8.8 explains these features: what really 
matters is the winding number of y around points that lie outside the domain D of f. 
This result emphasises the topology of the domain, and how the path lies within it. 
To improve our understanding, we examine these topological issues in more detail. In 
fact, we do so in two ways. In this chapter we consider continuous deformations of 
y, captured by the topological notion of homotopy. In Chapter 16 we discuss a related 
concept, homology. Both concepts formalise the idea that the domain D has ‘holes’, 
and the integral depends on how the path wanders around the domain, relative to these 
holes. However, they do this in two different, though related, ways. Homotopy is easier 
to visualise and geometrically quite natural. Homology is algebraically simpler, once 
some initial difficulties have been overcome, but it can appear artificial and contrived. 

We reformulate Cauchy’s Theorem from these two viewpoints. Each offers fresh 
insights. We also show that an arbitrary path y in a domain D can be replaced by a 
suitable approximating polygon without changing the integral of any function that is 
differentiable in D. This generalises Cauchy’s Theorem so that it applies to any closed 
path, removing the “piecewise smooth’ condition required of a closed contour. 


Informal Description of Homotopy 


Homotopy is one of the central ideas in topology. It describes topological features of 
spaces in terms of families of continuously varying paths. For example, consider the 
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Seevereererreue 


Figure 9.1 Deforming the semicircular path y) continuously into jy, . 


anticlockwise semicircle yo in C from 1 to —1 defined by 
vot) =e" (t € [0,7] 

and the clockwise semicircle y; in C from 1 to —1 defined by 
u@Q=e™" (te [0,7) 


Figure 9.1 shows visually that we can deform y into y; by displacing it vertically. In 
formulas, let 


ys(t) = (1 — s)e” + se“ 


Then yo and jy; are the two semicircular paths, and the path ys, varies continuously as s 
varies continuously from 0 to 1. 

However, if the paths are not allowed to pass through the origin (for example, they 
may be paths in the domain of a function that has a singularity at the origin, such as | /z), 
this continuous deformation is not permitted, because y;/2 passes through the origin at 
t = 7/2. A little experiment strongly suggests that no continuous deformation from 
yo to y1 exists if no intermediate path meets the origin. The origin is an obstacle to 
deforming the path, and any attempt to do so causes the path to get ‘hung up’ at the 
origin. The origin creates a hole, and the path cannot cross the hole. 

This suggestion is in fact true, but a proof requires some care: it can, for example, 
be based on properties of the winding number. Indeed, the winding number is a simple 
example of a homotopy invariant: a way to distinguish topologically different spaces 
using notions of homotopy. 

In this chapter we apply homotopy ideas to complex integration by considering what 
happens to iy f when we allow the contour y to vary continuously. We derive precise 
conditions under which y can be deformed continuously without changing the integral. 
This can be done in two ways. One is to fix the end points zo, z; and allow the contour 
between them to be continuously deformed. The other is to perform a continuous defor- 
mation of a closed contour. In both cases the deformations must keep the path inside a 
domain D on which f is differentiable. Figure 9.2 illustrates these two possibilities. 
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D D 


Figure 9.2 Two types of continuous deformation of a contour. Left: Keeping end points z;,z2 of y 
fixed. Right: Deforming a closed contour y. 


Both of these methods are special cases of a single result, The Cauchy Theorem for 
a boundary, which is proved in Section 9.3. Before this, in Section 9.2 we show how 
the conditions on y can be relaxed when f is differentiable to define [ y f along any path 
(recall that a contour is piecewise smooth). This lets us vary y freely without having to 
worry whether the intermediate paths involved are contours. 


Integration Along Arbitrary Paths 


Suppose that f : D — C is differentiable in the domain D. Up to now we have usually 
integrated such a function along a contour in D — a piecewise smooth path. However, 
because differentiable functions, are fairly well behaved, the precise path taken turns 
out not to be so important. We saw this in Lemma 8.5, where we were able to replace 
a contour by a step path and get the same result. We can use the same technique to 
define an integral along an arbitrary path y : [a,b] — D. This definition agrees with the 
Riemann integral if that happens to be defined — an application of Cauchy’s Theorem 
that we leave to the interested reader. 

The Paving Lemma gives a partition a = ft < t) < --- < t, = b such that each 
subpath y, defined on [¢,—1, t,] lies ina disc D, C D. Let 4 be the approximating polygon 
A =A, +---+An, where A, is the line segment from y(¢,_1) to y(t). Then A; lies inside 
D, for all r, so 4 lies inside D, and we can define the integral of f along y to be 


y x 
See Figure 9.3. 


This definition can be proved to be independent of the choice of the partition that 
determines A, provided these are chosen using the Paving Lemma as described above. 
For if extra division points are introduced between f,_1 and t;, say 


f-1 <Sj <-:+<5s, <t 


then y(si),..., v(sx) € D so the integral of f along A, has the same value as that along 
the polygon with vertices y(t,_1), v(51),---, (sx), v(t,), by Theorems 6.33 and 8.3. 
See Figure 9.4. 
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Figure 9.3 Replacing a continuous path by a polygonal one. 
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Figure 9.4 Subdividing a polygonal path inside a disc in D. 


Thus a finer partition of [a,b] does not change the integral. Given any two polygonal 
approximations A, 4 obtained using the Paving Lemma, let v be the polygon whose 
vertices are those of 4, 1 taken together. Then f, f = ff = Jf),f, so the definition of 
the integral is independent of the polygonal approximation — provided that is chosen 
using the Paving Lemma. 

An arbitrary polygonal approximation to y will not do. In Figure 9.5, suppose that 
D = C \ {zo}. The polygon 2 with vertices z1, 22, 23,24, 25, 26, 27 iS an approximation 
to y found using the Paving Lemma, but the polygon with vertices z1, z6, z7 is not. If 
f =1/@—z) then f.f — J, f = 27i, 80 [if # Jf. 

It is straightforward to check that Theorems 8.8, 8.9, 8.10, and 8.12 of Chapter 8 
also hold for arbitrary paths, however wild — even ‘space-filling’ curves, Section 2.9. 
For instance, if f is differentiable in D and does not wind round points outside D, then 
ie f = 0. With such results at our disposal we can widen the scope of our ideas and 
introduce general notions from topology. These provide further insight into the vari- 
ous versions of Cauchy’s Theorem and how they relate to each other. We return to the 
topological theme and develop it further in Chapter 16, using results from Section 9.3. 
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Figure 9.5 A different polygon can lead to a different value for the integral. 
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Figure 9.6 A rectangle and the path formed by its perimeter. 


The Cauchy Theorem for a Boundary 


The formal definition of homotopy (Definition 9.7 below) uses a map from a rectangle 
into C. The top and bottom edges of the rectangle determine two paths, and parallel 
segments of its interior determine a continuous deformation of one path into the other. 
We therefore begin by considering maps defined on rectangles, and properties of their 
boundaries. 

Let R be the rectangle 


R={x+iyeC:a<x<b,c<y<d} 
with boundary contour dR, Figure 9.6. (Here a < b andc < d are real.) Suppose 
that dR : [0,p] — C is parametrised by arc length, measured anticlockwise, where 


p = 2(b — a) + 2(d — c) is the perimeter of R, Figure 9.7. 
Given a continuous map ¢ : R —> C, we define the boundary of ¢ to be 


IPD = POR) (te [0,p]) 
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OR(1) 


Figure 9.7 Parametrising the perimeter by arc length. 


Figure 9.8 Boundary of the map ¢ in Example 9.2. 


Example 9.1. If @(x+iy) = xe’ (a <x < b,c < y <d) then 0¢ =T, +12 4+1934+T4 
as in Figure 9.8. 


REMARK 9.2. Here we are using notation in a flexible way by writing I’; for the image 
of Ij. Technically this is sloppy, but it avoids undue proliferation of ‘image’ symbols. 
Once we understand that a path differs from its image, we have earned the right to be 


sloppy. We exert that right repeatedly. 


Example 9.3. Let R = {x +iy:0<x<2,0 <y <2}. Forx+iye Randx+y>1, 
define ¢(x + iy) = x +1y, and forx+ iy € Randx+y < 1, define #( + iy) to be the 
reflection of x + iy in the line x + y = 1. Then the effect of ¢ is to fold over the bottom 
left-hand corner of R as in Figure 9.9. This shows that the image of 0¢@ need not be the 
boundary of the image $(R). 


However, Figure 9.9 provides the clue that all points inside 0¢ lie in the image @(R). 
We prove this is true in general: 


LEMMA 9.4. If@ :R — Cis continuous, then I(0g) € $(R). 
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Figure 9.9 Boundary of the map ¢ in Example 9.3. 


Proof. Suppose to the contrary that there exists zo € [(d@) but zo ¢ @(R). Let D= C\ 
{zo}. Then D is a domain and @(R) C D. If f(z) = 1/(271(z— z0)) then f is differentiable 
in D and 


/ f = w(0@, 20) 
ag 


is a non-zero integer. 

Subdivide the rectangle R by its vertical and horizontal bisectors into four equal 
rectangles RY R®, RO, RO and let go” be the restriction of @ to R®. Then for 
r = 1,2,3,4 the boundaries 06 are all closed paths in D. Because 


Le-z 


r=1 ag” 


f and 7 f = wad”, 20) 
ag 


at least one of the four integrals is a non-zero integer. Denote the corresponding rectan- 
gle R™ by Rj and the restriction of ¢ to R, by 1. Similarly dividing R, into four equal 
rectangles and repeating the process yields a nested sequence of rectangles 


RDR,D-::-DR,2-:: 


where each /. 3¢,/ 1S a non-zero integer. 
The sequence of rectangles contains a (unique) point z} € R. For e = |6(z) — Zo], we 
have N,(#(z1) © D. By continuity of ¢@ there exists 6 > 0 such that 


PN 5s ($21) VR) S Ne) 


For suitably large N we have Ry C N5($(z1)), so d@y is a closed path in the disc 
N,(@(z1)). But f is differentiable in N,(@(z,)), a star domain,. So oat = 0 by 
Corollary 8.4, contradicting /, ads f being a non-zero integer. 


We can now prove an important variant of Cauchy’s Theorem. But first (having 
decided not to rely on the topological notion of compactness) we need a two- 
dimensional analogue of the Paving Lemma. 


PROPOSITION 9.5. Let R be a rectangle and let 6 : R — D be a continuous map, 
with D © C a domain. Let R be divided into n* smaller rectangles, by subdividing its 
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sides into n equal parts. Then for sufficiently large n, the image of every subrectangle is 
contained in some disc in D. 


Proof. The proof uses the same ideas as that of Lemma 2.33. Say that a rectangle is 
pavable if if it can be bisected repeatedly a finite number of times into closed subrect- 
angles, so that the image of every subrectangle obtained in this manner lies inside a disc 
in D. Otherwise, the interval is unpavable. 

Suppose the result is false. Then R is unpavable. Split it into four closed subrectangles 
by bisecting the sides. At least one these subrectangles, call it R;, must be unpavable. 
Bisect R, to obtain an unpavable rectangle R,, and so on. This sequence continues 
indefinitely because we are assuming the result false, so we get a nested sequence of 
rectangles 


RDIRDR2QD::: 


each half the size of the previous one. The intersection of these rectangles is a unique 
point zo € R. 
However, D is open so there is a disc N;(@(z)) C D. If k is large enough, Ry C 
Ne(@(zo)), a contradiction. 
Now take n = 2*. 


THEOREM 9.6 (Cauchy’s Theorem for a Boundary). [f¢: R — Dis a continuous map 
from a rectangle R into a domain D, as in Figure 9.10, and f is differentiable in D, then 


,=o 
ag 


Proof. Cut R up into smaller rectangles R,, by dividing it into n equal parts horizontally 
and vertically, where 1 <p <n,1 < q <n. By Proposition 9.5 this can be done so that 


each @(Rpq) is contained in a disc Dyg C D. See Figure 9.11. 
Let pq be the restriction of @ to Ryg. Then its boundary d@yq is a closed path in the 
disc Dnyg GE D, so 


f=0 


Inq 


D 


dp 


Figure 9.10 Effect of map ¢ on rectangle R. 
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D 
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Figure 9.11 Subdividing the rectangle R, and the effect of @. 


Adding up all the integrals for 1 < p <n,1 < q < m, and cancelling integrals along 
opposite paths inside @(R), we are left with 


f=0 
ap 


Formal Definition of Homotopy 


A ‘homotopy’ between two paths yo : [a,b] — Dandy : [a,b] — Dis, roughly 
speaking, a continuously varying family of paths ys : [a,b] — D, where s runs over the 
interval [0, 1]. At the start, s = 0 and y; = yo; at the end, s = 1 andy, = yy. 

How do we make this precise? Notice that the whole family depends on two variables: 
the parameter, s, and the original variable ¢ giving the position on the path. We can 
combine everything into a single function y of (¢, 5) if we set 


ys)=ys) (¢ €[a,b],s € [0, 1]) 


It is then natural to insist that y, as a function of two real variables, should be continuous. 
We are thus led to: 


DEFINITION 9.7. A homotopy in D between yo and 1, as above, is a continuous map 
y : [a,b] x [0,1] > D 
such that 


y(t,0) =yo(t) for all t € [a,b] 
y(t,1)=y,(t) for all t € [a,D] 


We illustrate this idea in Figure 9.12. Note that we do not (yet) assume that yo and y; 
have the same start or end points. 

It will prove convenient to think of [a,b] x [0, 1] as a subset of C, by identifying (¢, s) 
with ¢ + is. 

Figure 9.12 captures well the continuous variation of y,, but it is misleadingly nice in 
that y is one-one. There is no reason to require this in general, so a perfectly reasonable 
homotopy could resemble Figure 9.13. 
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y(B) ¥(C) 


y(A)—ésy 
oe D i y(D) 
e e 
a h 


Figure 9.12 A homotopy between two paths yo and 71. 


Figure 9.13 A homotopy that is not one-one between two paths yo and y}. 


Geometrically, of course, [a,b] x [0,1] is a rectangle, and its boundary is a closed 
path, with corners. If we parametrise the four edges of the rectangle in some way, and 
then use y to map the result into D, we obtain paths in D that join up to give a closed 
path, as for example in Figure 9.12. 

Our aim is to apply Cauchy’s Theorem for a boundary to this set of paths. Now, 
there is a problem: the two edges that give yp and j; are obviously useful, but the other 
two edges, marked p,t in Figure 9.12, are going to be a nuisance. We therefore add 
conditions that eliminate them. There are two obvious ways to do this: 


(a) Insist that each of p and t goes to a single point in D. 
(b) Insist that o and t cancel each other out. 


These two conditions yield two more restricted types of homotopy: fixed end point 
homotopy, and closed path homotopy. We describe these in detail in the next two 
sections. 
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Fixed End Point Homotopy 


Consider the rectangle 
R={t+iseC:te [a,bd],s € [0,1]}} 


DEFINITION 9.8. Two paths yo : [a,b] — D andy : [a,b] > D are fixed end point 
homotopic in D if there is a continuous map 


@:R—-D 
and points zo, z1 € C such that 


o(t,0) = yo(t) for all t € [a, b] 
d(t,1) = y(t) for all t € [a, b] 
&(s,0) = 2 for all s € [0, 1] 
o(s,1) = 21 for all s € [0,1] 


as in Figure 9.14. 


If we let y;() = A(t, s) then y, is a path in D from 2p to z; for all s € [0,1], andas s 
increases continuously from 0 to 1, the path y, “deforms continuously’ from yg to y;. 


Example 9.9. Let D = {z € C: |z| < 2},yo() = t@ € [-1,1),n®O = 


lit _ 


e2 1) (t € [—1, 1]). Then » and y; are fixed end point homotopic in D, where 


o(t,5) =A —s)wO+syi) @el—1,1],5€ [0, 1) 


0o@ ee 


Figure 9.14 A fixed end point homotopy. 
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Figure 9.15 A fixed end point homotopy between yo and 71. 


As a corollary of Theorem 9.6 we deduce: 


THEOREM 9.10. Iff is differentiable in a domain D and yo is fixed end point homotopic 
in D to y, then Sys a Teg f. 
Proof. We have a continuous map ¢ : R > D where 


o(t,0) = yo(t) for all t € [a,b] 
o(t,1) = y(t) for all t € [a,b] 
P(0, 5) = zo for all s € [0, 1] 
éU,s) = z1 for all s € [0, 1] 
as in Figure 9.15. 
If p, : [0,1] — D is the path p,(t) = z, for r = 0,1, whose image is a single point 
{z,}, then ei = 0 and 0¢ = yo+ pi — ¥1 + po. By Cauchy’s Theorem for a boundary, 


SO 


Closed Path Homotopy 


Once more we consider the rectangle 
R={t+is €C:té [a,d],s € [0,1]}} 
but we impose different conditions on ¢: 


DEFINITION 9.11. Two (closed) paths yo : [a,b] — D and y; : [a,b] — D are 
homotopic via closed paths in D if there is a continuous map 


@:R-D 
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such that 
(t,0) = yo(t) for all t € [a, b] 
ot, D=n@d for allt € [a, b] 
d(a,s)= o(b,s) forall s € [0,1] 


as in Figure 9.16. 


ie ——-—— seouneonee™ 
“see uceeeete* / 

- 

Dp t 


i 


Figure 9.16 Homotopy via closed paths. 


Again, if we define y,() = @(t, 5) then y, is a closed path in D for all s € [0, 1], and as 
s increases continuously from 0 to 1, the path y, ‘deforms continuously’ from yo to j1. 


Example 9.12. For |zo| < K, letD = {z € C: |z| < K,z # z}. For |z| < p < K, let 
yo(t) = pe" (t € [0,2z]), and for 0 < ¢ < K — |zo| let 1, (t) = z + ee” (t € [0,27]), 
as in Figure 9.17. Then yo is homotopic to y, via closed paths in D, where 


os)=A—s)y+snQ (te [0,27],s € [0,1]) 


Figure 9.17 Homotopy between the closed paths in Example 9.12. 
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THEOREM 9.13. If f is differentiable in a domain D and closed paths yo, y, in D are 
homotopic via closed paths in D, then as — t, f. 


Proof. We have a continuous map @ : R — D such that 


(t, 0) = yo(t) for all t € [a, b] 
A(t, 1) = y(t) for all t € [a, b] 
d(a,s)= (b,s) forall s € [0,1] 


Let o(s) = d(a,s) (s € [0,1]). Then d¢ = yo +o — x1 — o (soo is acut from yo to yy 
in the sense of Section 8.7). By Cauchy’s Theorem for a boundary, 


Ltafr+p- fs- [r=0 
free 


A closed path y in D is homotopic to zero or null-homotopic if it is homotopic via 
closed paths in D to 6 : [a,b] — D where f(t) = zo for all t € [a, b], with zo € D. We 
immediately deduce: 


SO 


COROLLARY 9.14. Let f be differentiable in a domain D and let y be a closed path in 
D that is homotopic to zero. Then { , f =0. 


The geometric significance of being homotopic to zero is that y can be continuously 
deformed to a single point — more precisely, a path whose image is a single point — as in 
Figure 9.18. 

We can use a homotopy to prove that the interval of a differentiable complex function 
is very well behaved under reparametrisation, of the path of integration. Here we can 
work with merely continuous paths and parameter changes. For simplicity we do not 
change the parametric interval. It is a useful exercise to include that possibility (intro- 
duce an affine map [a,b] — [c, d] and examine how that affects the integral). In the 
smooth case it is also possible to use the formula for a change of variable in an integral. 


Figure 9.18 A homotopy to zero. 
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PROPOSITION9.15. Let f be differentiable on a domain D, and let d : [a,b] > C be 
a path in D. Let p : [a,b] — [a, b] be continuous, and let y = i 0 p. Then Lat a aa 


Proof. Define a homotopy o : [a, b] x [0,1] > D by 
o(t,s) =A — s)t+ spp) 
Clearly o,(t) € D fort € [a, b]. It is easy to check that 


o(t,0)=y 
o(t,l)=’ 


Therefore o is a fixed end point homotopy from y to 4. Now Theorem 9.10 applies. 


Converse to Cauchy’s Theorem 


Starting with Cauchy’s Theorem for a boundary we have deduced first the homotopy- 
invariance of the integral for fixed end point or closed path homotopies, and then that 
the integral is zero for a path homotopic to zero. However, we can also start with Corol- 
lary 9.14 and argue the other way if we wish. First we need a simple result about 
changing the parameter of a path: 


PROPOSITION 9.16. Let y : [a,b] — D be a path and let p : [a,b] —> [a,b] bea 
continuous map such that p(a) = a, p(b) = b. Then y is homotopic to y © p. 


Proof. Define ¢ : [a,b] x [0,1] — D by 
p(t, s) = y((1 — s)t + sp(t)) 
This is a homotopy, and #(t,0) = y(t), (4, 1) = y(e(). 


Note that the reparametrisation p is not required to be a bijection here. Also: all paths 
ys have the same image. In effect we are performing a homotopy on the parameter. 
We can now prove: 


PROPOSITION 9.17. A closed path y in D is a boundary 0¢@, up to reparametrisation, 
if and only if y is homotopic to zero. 


Proof. The need to reparametrise the interval on which y is defined arises because of the 
way we have chosen a specific parametrisation for a boundary 0@. By Proposition 9.16 
we can adjust the parameter provided the images of y and 0¢ coincide. This reduces the 
proof to a geometric argument. We give the essence of the proof in a series of pictures, 
leaving the reader the (routine) task of analytic definitions and verifications required to 
make it rigorous. 

Define a map H : R —> R, where R is a rectangle, as in Figure 9.19. The definition 
proceeds in stages: 


(1) Identify opposite vertical edges to get a cylinder. 
(2) Squash the top rim to a point to get a cone. 
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; collapse top 


to a point 


open oul 


flat 


deform to a 


A rectangle 


Figure 9.19 Topological proof of Proposition 9.17. 


(3) Open the cone out flat to get a disc. 
(4) Distort the disc to get a square. 


Suppose that y is a boundary, say y = 0¢ where d : R— D. Then@doH:R — Dis 
a homotopy. The lower edge of R, marked by the heavy line in the first diagram, maps 
to (the image under ¢ of ) 0g. The top edge maps to a point. So y is homotopic to zero. 
Conversely, suppose that y is a closed path homotopic to zero. Then we can define 
a map of the cone into D such that the base circle goes to (the image of ) y. Therefore 
(reversing the last two steps in the definition of H) we can map R into D so that its 


perimeter goes to y. Hence, up to reparametrisation, y = d¢. 


The Cauchy Theorems Compared 


A common cause of distress for students of complex analysis is the sudden appearance 
of a plague of Cauchy Theorems, having several variant hypotheses and a similar variety 
of conclusions, but all begin derivable from each other. At times like this it may be 
advisable to seek consolation elsewhere than mathematics: perhaps among the poets. 
Rudyard Kipling’s Jn the Neolithic Age makes the point admirably: 


There are nine and sixty way of constructing tribal lays 
And — every — single — one — of — them — is — right! 


It is much the same with the Cauchy Theorems: all of the different versions are 
essentially the same. 

At the heart of all Cauchy-type theorems is the local existence of an antiderivative, 
Section 8.5. The theorems themselves supply, as hypotheses, conditions that permit this 
local result to be globalised in some way. 
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For example, in Chapter 8 the local result led to the central Cauchy Theorem 8.8, that 
if f is differentiable in D and a closed contour does not wind round any point outside D, 
then / 4 f = 0. This ‘non-winding’ condition ensures that the local pieces of antideriva- 
tive fit together well on the global level. The generalised version, with several contours 
V1, +++»Yy; 1S a simple corollary obtained by making cuts between the contours. 

In this chapter we studied what happens when a contour is deformed continuously. 
Again the local existence of an antiderivative is involved: it lets us define the integral of 
f along an arbitrary path, and it gives the main result of that chapter, that the integral of 
f along a boundary is zero. This is also a globalisation: associated with any boundary is 
a map of an entire rectangle, and the local antiderivatives all fit together properly across 
the image of this rectangle. Moreover, we saw that a path is a boundary if and only if it 
is homotopic to zero. Since homotopy leaves the integral round a closed path invariant, 
this makes the reason why yJ = 0 transparent. 

We therefore have two main variants of the Cauchy Theorem: one for paths that do 
not wind round points outside D, and another for paths that are homotopic to zero. But 
(more Kipling), like the Colonel’s Lady and Judy O’Grady, they are ‘sisters under their 
skins’. Obviously if y is homotopic to zero and zo ¢ D, then 


1 1 
WY 20) = 7 dz=0 
211 y 2-20 


because 1/(z — zo) is differentiable in D. So the ‘non-winding’ version of Cauchy’s 
Theorem easily implies the ‘homotopy’ version. 

It is in fact strictly stronger, in the following sense. The path in Figure 9.20 does not 
wind round any zy ¢ D, but it is manifestly not homotopic to zero. (However, this is 
harder to prove than it might appear.) So the ‘non-winding’ hypothesis is weaker, hence 


applies to more cases. 

Even this surface difference vanishes when we look a little more deeply, however. 
Every path that does not wind round any z) ¢ D can be transformed, using a series 
of cuts whose contributions to the integral cancel, into a closed path (or set of paths) 
homotopic to zero. For example, Figure 9.20 (left) is so transformed in Figure 9.20 
(right). This fact is also non-trivial to prove, but it shows that the practical consequences 


SSD SF 


D D 


Figure 9.20 Left: A path that satisfies the non-winding condition but is not homotopic to zero. 
Right: Creating cuts in the path makes it homotopic to zero. 
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of the extra generality are largely spurious. (The theoretical consequences are more 
important: the ‘non-winding’ condition is part of ‘homology theory’, and what we have 
here is a relation between homotopy and homology. We postpone a homology version 


of Cauchy’s Theorem to Chapter 16, so that we can move on to more practical issues in 
Chapters 10-14.) 


Exercises 


1. LetD = {2 € C:z 4 Of and S.A = re" (t € [0,27]) for r = 1,2. Define a 
continuous map @ : R + D, where R is a rectangle, such that 


he ee 


for any f differentiable in D. Use Theorem 9.6 to deduce that 


ee 


Describe a homotopy via closed paths in D from S$, to Sp. 


2. Let 71,2 be closed paths in a domain D that are homotopic via closed paths in 
D. By making a suitable cut o from y; to y2, describe a fixed end point homo- 
topy in D from 4, to o + y2 — o. Draw a picture to illustrate the continuous 
deformation. 


3. Let the boundary 0¢@ of a continuous map ¢ : R — D be subdivided into two 
subpaths 0@ = y, + y2. Describe a fixed end point homotopy in D from y; to —y. 
Draw a picture to illustrate the continuous deformation. 


4. Draw the semicircle y(t) = (tre [—2/2,2/2]) in D = C \ {0}. Define two 
explicit polygonal paths 41, 42 from —i to iin D such that 


[r-fo 


is true for all f differentiable in D when 4 = A, but false when A = Ap. 
5. Let y : [0,1] — C be given by y(0) = y(1) = 0 and 


t+ it sin(z/t) for t € [0, 4] 


y= (1-) +i —dsina/(1—9) forte [1,1] 


Show that y is a closed path but not a contour, and draw a sketch. 


6. Integrate the following functions around the path y in Exercise 5: 
(i) cos? (z*) 
(oe) 


Gi) So z"/n 
n=1 
Gi) 1/(@— 4/2) 


7. 


10. 
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Let f be differentiable in D. For a closed path y in D, beginning and ending at zo, 
the integral value I, is the complex number 


w= fs 


Show that the set of integral values forms a commutative group /(f, D) under the 
operation 


Ly +15 = y+6 


Determine the group of integral values in the following cases: 
(i) f(z) = 1/z,D = C \ {0} 

(ii) f(z) = cosz,D = C \ {0} 

(iii) f(z) = 2/(z-—1) + 3/(z +1), D=C \ {+1} 

Gv) f@ = /@-)4+ 2/@+1),D=C\ {+1} 


. The Fundamental Group. Let D be a domain and zo € D. For closed paths y,6 in D 


that begin and end at zo, define y ~ 6 to mean that y, 6 are fixed end point homo- 
topic in D. Show that ~ is an equivalence relation. Let [y ] denote the equivalence 
class containing y, and let 2(D, zo) be the set of equivalence classes. 

Define the operation * on 2 (D, zo) by 


[vy] * [5] = [y +4] 


Check that « is well-defined, and show that 2 (D, zo) is a group under , specifying 
the identity element and the inverse of [yj]. 
For any other point z} € D and any path o in D from Zo to z, define g : 
m(D,z0) — 2(D, z1) by 
sly) =[-o+y +09] 
Show that g is an isomorphism of groups and deduce that m(D, z) is independent 
of the choice of z € D. (This result relies on D being path-connected.) For this 


reason, the group 2 (D, zo) is usually denoted by z(D) and called the fundamental 
group of D. 


. Describe (without formal proof, since we have not developed suitable techniques) 


the fundamental groups of the following domains: 

(i) C 

di) {ze C: |z| < 1} 
(iii) {2 € C: |1 < z| < 2} 
(iv) C \ {0} 

(v) C\ {+} 
(vi) C\Z 
Let f be differentiable in the domain D, and let y, 6 be closed contours in D begin- 
ning and ending at the same point zo. Show that if y ~ 6 in the sense of Exercise 8, 
then the integral values J, and J are equal. Prove that the map h : 7(D) — I(f, D) 
for which h([y]) = J, is a well-defined group homomorphism. Describe the 
homomorphism /h for each f in Exercise 7. 
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11. Integrals Along Arbitrary Paths. For fixed z,, z € D, suppose that yo is a fixed path 
in D, and y is a variable path in D, both from z, to z2. Show that 


[r= [rt 


for some o € I(f,D). Prove that if y is deformed continuously in a homotopy via 
closed paths in D, then J, remains constant. 

For z] = —i,z2 = i, determine all possible values of i for each f,D in 
Exercise 7. 


10 


Taylor Series 


We now reach a stage in the theory were we can take a great leap forward and show, 
as promised repeatedly, that any differentiable complex function has a local power 
series representation. On the slim assumption that the derivative of f exists throughout 
a domain D, we find that near any point zg € D there is a power series expansion 


[o.) 
fla +h) = anh" for 29 +h € N,(z0) 
n=0 


which is valid — that is, converges — on any disc N;(zo) C D, see Figure 10.1. 


Figure 10.1 The power series round zg converges on any disc in D. 


This theorem releases a tidal wave of theorems, because we can apply the results of 
Chapter 4 on power series to any differentiable function f. For instance, Corollary 4.21 
states that a power series can be differentiated term by terms as many times as we like. 
Taylor’s Theorem, Corollary 4.22, gives an explicit formula: 


f(z) 
fZ+th= >» — for 2 +heéeN,(z) CD 
n=0 : 


Thus if we insist only that the first derivative f’ exists in D, it follows that all higher 
derivatives exist, and the function is equal to its Taylor series in N;(zo). We derive subtler 
consequences in this and subsequent chapters, and Cauchy’s Theorem plays a prominent 
role. It is this sequence of results that gives complex analysis its own special flavour. 
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Figure 10.2 Contours in the proof of the Cauchy Integral Formula. 


Cauchy Integral Formula 


The proof that any differentiable function can be expressed as a power series depends 
on a result of Cauchy, itself of intrinsic interest: 


LEMMA 10.1 (Cauchy Integral Formula for a Circle). Let f be differentiable in the disc 
Nr(zo) = {2 € C: |z—zo| < R}. For0 <r <R, let C, be the path C(t) = z+ re (t € 
[0, 27r]). Then for |w — zo| < r we have 
1 f(2) 
fw) = 


271 Jc,Z—W 


dz 


Proof. Fix w such that |w — z| < r. The function F(z) = (f(z) — f(w))/( — w) is 
differentiable in the domain 


D={2E€C:|z—-z| <R, zw} 
Let 0 < e€ < r—|w — z|. Then the circle S;, centre w, radius e, is 
S-(t) =z tee" (t € [0,27]) 


and lies in D, as do all points inside C; and outside S,, see Figure 10.2. 
By the Generalised Cauchy Theorem, Theorem 8.9, 


[ roa= [ F(z)dz (10.1) 


Now lim,_,,, F(z = f’(w), so for some 5 > 0, M > 0, we have 
0 < |z—w| <6 implies |F(z)| < M 


The Estimation Lemma, Lemma 6.41, implies that for ¢ < 4, 


| i F(z)dz 
Se 


<M.2m7e 
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From (10.1), 
| F(z)dz| < 2Mre 
C; 
Since ¢ is arbitrary, this implies that 
J F(z)dz = 0 
Therefore 
ie w 
as fa. 
Cc zZ-W Cc, Z—W 
1 
=f(w) | ——dz 
CZ-WwW 
= f(w) - 271 
In other words, 
1 2) 
r= [ Ma 


2mi Jc, Z—W 


Taylor Series 
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Using the Cauchy Integral Formula we can now expand f(z +/) as a power series in h, 


with coefficients expressed as integrals. 


LEMMA 10.2. Let f be differentiable in Nr(zo).Then there exist an, € C such that 


feo +h) = So anh! 
n=0 


where the series converges absolutely for |h| < R. Further, if 


C(t) = z + re" (t € [0,27]) 


1 fo 
271 ij (z — zo)ttl = 


then 


Proof. Fix h with 0 < |h| < R and initially suppose that r satisfies |h| < r < R, 


Figure 10.3. 
The Cauchy Integral Formula gives 


1 
feo +h = 7 F@) 


271 Cc <7 (zo +h) 
h h” 
~ Oni [PO pe "@—zr | @—zoyrr#l 
pymt+i 
"(z= zo)"*H(z = zo — h) a 


m 
= >. ayh"™ + Am 
n=0 
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Figure 10.3 Contour for proof of Lemma 10.2. 


where 


1 f() 
” =| eat 


r 


1 | f( z) jyin+ 1 


A =, 
"Oni Ic, (@ — zo" *1(z — zo — h) 


We demonstrate that limm—+oo Am = 0. 
First, since f is differentiable it is continuous, so @(f) = | f(C;(H)| is a continuous real 
function on [0, 27]. From real analysis, @ is bounded, so 


|d(t)| < M_ forzonC, 
Now |h| < r,|z—zo| =r, and 
|Z— Zo —Al = |lz— zol — |All =r — [Al 
Therefore the Estimation Lemma gives 


4) <2 Mn er — MAL (1MLy” 
= 
™~ Qn rt l(r — |h]) r—|hl 


r 


Since we chose |h| < r, this tends to zero as m tends to infinity. Thus 


jim. ( te Sou =0 


n=0 


which means that 
CO 
feo +h) = Y0 anh" 
n=0 


This expansion is valid for |h| < R, and 


i f(z) 
2ni [ (z —zo)"t! . 
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for |h| < r. The latter restriction can now be seen to be unnecessary, for the integral is 
differentiable for 0 < |z—zo| < R, so the integral is unchanged if r is varied in the range 
O<r<R. 


Once we know that a power series expansion exists, we can use our knowledge of 
power series to deduce: 


THEOREM 10.3 (Taylor Series). If f is differentiable in a domain D, then all higher 
derivatives of f exist throughout D. In any disc Nr(Z) © D the Taylor series expansion 


20 g(r) 
fo +h) = een (10.2) 


n=0 


is valid. Further, if 0 < r < Rand C,(t) = z+ re" (t € [0,2z]), then 
! 
f(z) — / f@) dz 


oni zZ- zoyttl 


Proof. From Lemma 10.2, 


CO 
fo +h) = Y anh" for |h| <R 
n=0 


In other words, putting z= zo + A, 


[o,@) 
f= ds an(z— 20)" for iz — zo] < R 
n=0 


Now, a power series may be differentiated term by term as often as we please, and by 


Corollary 4.22 
) = _ nt f(z 
fe) =H a @— zy 


This gives the desired integral expression for f(™(zo). Substituting a, = (f(zo))/n! in 


the power series gives the Taylor expansion. 


Theorem 10.3 was first proved by Cauchy in 1831 by the method above. The series is 
named after Brooke Taylor, who in 1715 was the first to publish the idea that a function 
can be expanded as a power series of the form 


FO yf 


n! 


fath=>> 
n=0 


Taylor’s theory was restricted to real functions, and it will come as no surprise that the 
idea was previously known to others; specifically to James Gregory, who was aware of it 
some 45 years earlier, and to Newton in 1691. During the eighteenth century there were 
various attempts to base the theory of real analysis on power series, the most famous 
being that of Joseph-Louis Lagrange in 1797. Cauchy used power series extensively in 
complex analysis. It is a curious quirk of fate that in 1829 he quoted the counterexample 
f@) =e * to show that not every infinitely differentiable real function is equal to 
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its Taylor series. Just two years later he went on to show that all differentiable complex 
functions have valid power series expansions. 
We now introduce some standard terminology, which until now we have avoided. 


DEFINITION 10.4. A real function f : D — R, where D C R, or a complex function 
f :D— C, where D C C, is analytic if for eacha € D it has a power series expansion 


flath)=>— anh" 
N=0 


valid in some neighbourhood of a. 


Cauchy demonstrated that in the real case there are functions that are infinitely dif- 
ferentiable but not analytic. But in the complex case he proved that any function that 
is differentiable once in a domain must be analytic. At a stroke he showed that in this 
respect complex analysis is simpler than real analysis, and reduced the general study of 
differentiable complex functions to computations with power series, giving the sequence 
of theorems that unfolds in the next few sections. 


REMARK 10.5. A complex function on a domain is differentiable if and only if it is ana- 
lytic. The two words just emphasise different points of view — existence of a derivative, 
existence of a power series expansion — and may be used interchangeably. 


Morera’s Theorem 


First we have a partial converse to Cauchy’s Theorem, due to Giacinto Morera in 1889: 


THEOREM 10.6 (Morera’s Theorem). Jf f : D — C is continuous in a domain D and 
lof = 0 for all closed contours y in D, then f is differentiable in D. 


Proof. By Theorem 6.44, if f , f = 0 for all closed contours y in D, then there exists a 
differentiable function F in D whose derivative is f. But now we know that F is twice 
(indeed infinitely) differentiable, so f = F’ is differentiable. 


This theorem explains our warning in Chapter 6 that it is futile to try to find an 
antiderivative F for a non-differentiable complex function f. It cannot have one. So there 
is a class of functions, including f(z) = |z|, that are continuous but not differentiable. 
Such functions are integrable, via the formula 


b 
/ f= / Fly @)y"(ae 
y a 


but the Fundamental Theorem of Contour Integration is no use whatsoever in this con- 
text, because f has no antiderivative — in stark contrast to the real case, where every 
continuous function has an antiderivative. (It can be argued that here real analysis is 
simpler than complex.) 

To summarise what we know about the existence of derivatives and antiderivatives 
for complex functions: 
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If f is differentiable in a domain D, then all higher derivatives of f exist. The func- 
tion f has an antiderivative only when f is differentiable, and even then, only local 
antiderivatives are guaranteed. Specifically, if D1 GC Dis simply connected, then f has 
an antiderivative in D;, Theorem 8.12. In particular, we can guarantee the existence of 
an antiderivative of a differentiable function in any disc contained in its domain. 


Cauchy’s Estimate 


Theorem 10.3 includes a generalisation of Cauchy’s Integral Formula to the higher 
derivatives of a differentiable function: 


f(z) = n! / f@) 


2ni Jc, (Z@—z)"t! 


where Nr(zo) € Dand 0 < r < R. With the standard conventions that f(z) = f(z) and 
0! = 1, this formula is true for all integers n > 0. Using it, we can give an upper bound 
for | fo): 


LEMMA 10.7 (Cauchy’s Estimate). If f : D — C is differentiable for z—z < R, 
0<r<R,and|f(2| <M for |z— a| <7, then 


Mn! 
M,.\| < 
LPs 


Proof. 
(ny, — | @ / f@) 
Leo = [55 |. Goa 
! 
Qn rt 
Mn! 


Cauchy’s Estimate yields an important theorem of Joseph Liouville, which has an 
unexpected application to a purely algebraic problem: 


THEOREM 10.8 (Liouville’s Theorem). If f is differentiable and bounded in the whole 
complex plane, then f is constant. 


Proof. Suppose that |f(z)| < M for all z € C. Cauchy’s Estimate applied to the 
derivative gives 


M 
If@l < — 
;: 


Since f is differentiable on C we may let r > oo, making M/r as small as we please. 
Since | f’(z)| is independent of r, we have 


Ffm=0 


Thus f’(z) = 0 throughout C, and (integrating) f is constant. 


The application is: 


214 


10.5 


Taylor Series 


THEOREM 10.9 (Fundamental Theorem of Algebra). Let P(z) = z’ +ayz""!+---+ay, 
be a polynomial, where n > 1 and a,...,a, € C. Then there exists w € C with 
Pw) = 0. 


Proof. For a contradiction, suppose that P(z) # 0 for all z € C. Then 1/P(z) is 
differentiable throughout C. 
Suppose z 4 0. Then 


P(Z) aj Gn 
=14 fees > las |z| > oo 
ge Zz a 
So there exists k > 0 such that 
P 
@) . 1 for |z| > k 
ie 
Thus 
1 2 
< < for |z| > k 
P(Z) \z"| ~~ k" 


The same bound works for 1/P(z) throughout C. For if z < k we take a circle Cr centre 
zo, radius R, which is so large that |z| > k for all z on Cr. Then 


1 2 
PO < a for all z on Cr 
and Cauchy’s Estimate gives 
1 2 
a ee: 
P(z)| ~~ k® 


By Liouville’s Theorem, 1/P(z) is constant, so P(z) is constant. But this contradicts 
n > 1. Therefore P(w) = 0 for some w € C. 


It follows in the usual way that any polynomial P(z) of degree n, with complex 
coefficients, can be expressed as a produce of terms of degree 1: 


P(z) = (z — a1)(z — a2) ++ -(Z — On) 


where the aj € C. 


Zeros 
We now broaden our perspective from polynomials, and look at the zeros of arbitrary 
differentiable functions. 


DEFINITION 10.10. A zero of a differentiable function f : D > C is a point zo € D 
for which f(z) = 0. 


Expanding f in a Taylor series about the zero zo, we have 


f=) anlz—z0)" for |z—zol < R 


n=0 
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where Nr(zo) C D. Then ag = f (zp) = 0, and two distinctly different things can occur: 
either all of the other a,, are zero, in which case f(z) = 0 for all z € Np(zo), or there 
exists m > 1 such that 


a =a, =-::=4y,_; = 0, but a,, 4 0 (10.3) 


DEFINITION 10.11. If (10.3) holds, then zo is a zero of order m, or a zero of finite order 
if m is unspecified. 


The notion of a zero of order m relates directly to that of an infinitesimal of order m 
in Chapter 15. 

Formula (10.2) for the Taylor coefficients implies that a zero of order m can be 
characterised by the condition 


f(2o) =f (zo) = --- =f" -Y(zo) = 0, but f™(zo) # 0 


Another useful expression for such a zero is to write 
fo) = (@ — z0)"8@) (lz —zol < R) 


where 
CO 
82) = > amin — 2)" 
n=0 
is differentiable for |z — zo| < Rand g(zo) = adm £0. 


This leads to a fundamentally important idea: 


DEFINITION 10.12. A zero zo of a differentiable function f : D — C is isolated if 
some disc centred upon it contains no other zeros of f; that is, there exists 6 > 0 such 
that 


0 < |z— %| < 6 implies f(z) 4 0 


LEMMA 10.13. A zero of finite order is isolated. 


Proof. Write f(z) = (z — 2o)"g(z) for |z — zp| < R, where g is differentiable and 
2(zo) 4 0. Then g is continuous at zo, so, taking ¢ = 5 g(zo)| there exists 6 > 0 such 
that 


|Z — Z| < 6 implies |g(z) — g(Zo)| < € 


Therefore, when |z — zo| < 6 we have 


Ig(z)l = Ilg@o)| — lg@o) — g@)|| > 2e -e =e 


In particular, g(z) 4 0. But if 0 < |z — z| < 6 then |z — zo|” 4 0, so f(z) = (z — zo)” 
g(z) £0. 


COROLLARY 10.14. Let S be a set of zeros of a differentiable function f in D, having 
a limit point zo € D. Then f is identically zero in any disc Nr(zo) © D. 
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Proof. Because zo is a limit point of S, there is a sequence {z,},>1 in S that tends to zg. 
Then f(zo) = limn+oof (Zn) = 0. So zo is a zero of f that is not isolated. Hence it does 
not have finite order, so 


f(zo +h) = y. Anhy 
n=0 


in any disc Nr(zZo) € D, where all ay are zero. 


From this we deduce: 


PROPOSITION 10.15. If f is differentiable in a domain D and S is a set of zeros of f 
with a limit point zo € D, then f is identically zero on D. 


Proof. Corollary 10.14 gives f(z) = 0 for z in any disc Nr(zo) € D. 
For any other z € D, choose a path y : [a,b] — D from Zo to z, Figure 10.4. We show 
that f(y ()) = 0 for all t € [a,b]. By continuity, we can find 6 > O such that 


a<t<a+ 5d implies y(t) € Nr(zo) 


so f(y(t)) = 0 fora <t <a-+4. Lets be the least upper bound of those x € [a, b] such 
that f(vy(x)) = 0 fora<t<x.Thena+65<s<b. 

By continuity, f(y(s)) = 0. If s < b then y(s) is a non-isolated zero, so f is identically 
zero in a neighbourhood of y(s). Then there exists an interval [s,s + «] with k > s, on 
which f(y(t)) = 0, contradicting the definition of s. Hence s = b, so f(z) = f(y(b)) = 
0. 


An important consequence is: 


THEOREM 10.16 (Identity Theorem). If f and g are differentiable in a domain D and 
f@ = g(z) for all z ¢ S C Dwhere S has a limit point in D, then f = g throughout D. 


Proof. Apply Proposition 10.15 to f — g. 


Figure 10.4 Proof of Proposition 10.15. 
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Example 10.17. It is essential for the relevant limit point of S to be in D: if not, the 
theorem can be false. Let 

f(2 = sind /z) 

g(z) =0 


in D = C \ {0}. Then f(z) = g(z) for z = +1/(nz) and z = 0 is a limit point of 
S = {+1/(n7)}, but f A ginD. 


Extension Functions 
Combining the Identity Theorem with power series leads to a method that can often 
extend the domain of definition of a complex differentiable function. 


DEFINITION 10.18. A function f : D — C is an extension function of h: S > C if 
S C D and f(z) = A(z) for all z € S. (We also say that f extends h.) 


Example 10.19. f(z) = 1/(1 — z) onD = C \ {1} is an extension function of h(z) = 
Lies ws = eC: [z= 1; 


Suppose that D is a domain and S C D has a limit point in D. The Identity Theorem 
shows that if a function f : S — Chas a differentiable extension function g : D C C, 
then the extension is unique. 

As an application, suppose that f : D > C is differentiable and consider the Taylor 
expansion f(z) = )-p2.9 an(z — zo)" of f in a disc N;(zo) C D. Then f is the only 
possible extension of the Taylor expansion to the whole of D. This means that although 
the Taylor expansion may not converge on the whole of D, it contains all the information 
needed to determine f uniquely throughout D. We use this fact to great advantage in 
Chapter 14. 


Example 10.20. Consider the power series 


2 


rae 


on the small disc S = {z € C: |z| < 1/1000 000}. We can certainly extend this function 
to a larger disc, because the radius of convergence of the power series is 1. However, 
we can extend beyond this. For instance, if K = {tf e R: ¢t < —l} andD=C\K, 
then f(z) = log(1 + 2) is the (unique) differentiable extension of the power series to D, 
Figure 10.5. 
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S is atiny disc radius 10 


Figure 10.5 Extending the power series (10.4) to C \ K. 


The set S need not be a domain. In particular, it may be a subset of R: 


Example 10.21. f(z) = sinz for z € C is the unique differentiable extension function of 
{( = sinx for x € R. This is a consequence of the usual convergent Taylor expansion 


in the real case. 


Of course, if D is a domain then it is open, so DM R is open in R. A differentiable 
function f : D + C has a power series expansion in a neighbourhood of any x9 € DOR. 
So if a real function h : S — R extends to a differentiable complex function on a 
domain, it must already have a power series expansion about any point in S C DOR. 
Thus the only real functions that extend to differentiable complex functions are real 
analytic functions. 

We may take S to be even more restricted, provided that it has at least one limit point 
in D: 


Example 10.22. If f(1/n) = 1/n? for all positive integers n, then f(z) = z* is the unique 
analytic extension of f to the whole complex plane, because 


S = {1/n: nis a positive integer} 


has the limit point 0 € C. 


The Taylor expansion is valid on discs (Theorem 10.3), and may therefore be used to 
define extension functions. It is therefore easy to see that the radius of convergence of 
the Taylor expansion of a function f about a point zp is equal to the distance from zg to 
the nearest point z, at which no differentiable extension function of f may be defined. 
Such a point is called a singularity of f, see Chapter 11. Singularities are obstacles that 
determine the radius of convergence of a Taylor series. 

(There is no relation between this use of ‘singularity’ and the term ‘singu- 
lar’ in Definition 6.20. ‘Singular’ is just one of those words that get overused in 
mathematics.) 
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Local Maxima and Minima 


The complex numbers are not ordered, so we cannot speak of maxima and minima of a 
complex function f. We can, however, consider maximum and minimum values of the 
modulus | f|, since this is real. 


DEFINITION 10.23. Iff : D — C, then 


(i) The modulus |f| has a local maximum at zo € D if there exists ¢ > 0 such that 
N,(2o) © D and | f(z)| < |f(Z)| for all z € N,(zp). The local maximum is strict if 
If(Z)| < |f@o)| for all z € Ne (Zo) \ {zo}. 

(ii) The modulus | f| has a local minimum at zo € D if there exists ¢ > 0 such that 
Ne(zo) © D and |f(z)| = |f(zo)| for all z € Ne(zo). The local minimum is strict if 
If(Z)| > |f(@o)| for all z € N, (Zo) \ {zo}- 


The problem of finding local maxima of |f| in a domain turns out to be easy (or 
impossible, depending on how you look at it, since there are none.) 


PROPOSITION 10.24. A differentiable function has no strict local maximum of its 
modulus in its domain. If it has a local maximum in its domain, it is constant. 


Proof. Suppose that f is differentiable in D and let zo be a local maximum in D. Then 
Lf(@| < |f(Zo)| for all z € Ne (zo). For0 < r < e, the circle C;(f) = zotre” (t € [0,27]) 
lies inside N-(zo), so | f(zo + re")| < |f(zo)| for all t € [0,27]. The Cauchy Integral 
formula gives 


1 
f@o) = 5 PO a: 
1 G et 
=i ei oe rev). cit 
~ Beri ret 
Lf f(zo + rel’)dt 
~ Ori 
so that 
1 20 : 1 20 
If(zo)| < =| | f(zo + re")|dt < | |f(zo)\dt < |f(zo)| 
wT Jo 20 0 
Hence 


1 Qn . 
l= 5 f Lflco+ rela (10.5) 
Tw Jo 


If the strict inequality | f(z + re'’)| < |f(zo)| were to hold for any t € [0, 27], then by 
continuity it would also hold in a small interval, giving the strict inequality 


1 20 : 
— | Lf@o-+ reldt < | f()| 
aA) 


contradicting (10.5). 
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Hence | f(zo + re'’)| = |f(zo)| for all t € [0,277]. This holds for any r < ¢, so|f| is 
constant in N;(zo). By Proposition 4.15, f is constant in N-(z). The Identity Theorem 
now shows that f is constant throughout D. 


For the complementary notion of a local minimum of |f|, it is clear that if f is non- 
constant and has a zero at zg then Z is isolated and |f| has a strict local minimum 
there. If f is non-zero on D, observe that |f| has a strict local minimum if and only if 
|1/f| = 1/|f| has a strict local maximum. We can apply Proposition 10.24 to 1/f to get: 


PROPOSITION 10.25. A differentiable function that does not equal zero anywhere in 
its domain has no strict local minimum of its modulus in its domain. If it has a local 


minimum in its domain, it is constant. 


The Maximum Modulus Theorem 


The question of maxima or minima of |f| on an arbitrary subset of the domain of f is 
somewhat different. 


DEFINITION 10.26. Let f : D — C be differentiable in a domain D, and let S C D. 
Then | f| has alocal maximum on S at zo € S if 


(i) Zo is a limit point of S. 
(ii) For some ¢ > 0, | f(z)| < |f(Zo)| whenever z € N,(Z0) NS. 


Condition (1) is essential, for otherwise some neighbourhood N;(zo) contains no 
points of S other than zo, and then condition (ii) is vacuously true. 


Example 10.27. If f(z) = e* and S = {z € C: |z| < 1}, then | f(x + iy)| = e* and | f| has 
a local maximum on S at the point zo = 1. 


Using Proposition 10.24, we see that if Ne(zo) C S, then | f| cannot have a strict local 
maximum at zg. To explore this further, we need: 


DEFINITION 10.28. A point zp is a boundary point of S if every neighbourhood of zp 

contains a point in S and a point not in S, other than 2g itself. In other words, a boundary 

point is a limit point both of S and its complement C \ S. 
The boundary dS of S is its set of boundary points. 


Example 10.29. The boundary of S = {z € C: |z| < 1} and of of its complement 
{z€C: |z| > l}is the circle {z € C: |z| = 1}. 


We can now rephrase Proposition 10.24 to give: 
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THEOREM 10.30 (Maximum Modulus Theorem). [f a differentiable function is not 
constant, then any local maximum value of its modulus on an arbitrary subset of its 


domain occurs on the boundary of that set. 


We also have from Proposition 10.25: 


THEOREM 10.31 (Minimum Modulus Theorem). Jfa differentiable function is not con- 
stant, then any local minimum value of its modulus on an arbitrary subset of its domain 
occurs either at a zero of the function, or on the boundary of that set. 


Example 10.32. If f(z) = z” on the set § = {z € C: |z| < 1}, then the maximum values 
of | f(x +iy)| = x? + y? occur all round the boundary of S, while the minimum value 


occurs at the origin. 


Exercises 


1. Find the Taylor series at 0 of f(z) = Log(1 + z), where Log is the principal 
value. What is the disc of convergence? Answer the same questions for g(z) = 
exp(a Log(1 + z)), where a € C. 


2. Find the first three terms and radius of convergence for the Taylor series at 0 of 
F(z) =[1+Log( —gJ-!. 
3. Taylor expand the following functions around 0, and find the radius of convergence. 
(i) sin? z 
(i) 2(@+2) 
(iii) (az + b)~! (a,b €C,b £0) 
(iv) fo exp(w*)dw 
. fe (sinz)/z (z #0) 
(z= 0) 
(vi) fo (sinw)/w dw 


4. Define the numbers c, by the Taylor series 


secz = Ye Yo gn 


Prove that 


co=l 


2n 2n 2n 
O=coter,| , Jel 4 Jr teml 5, 


Show that c2, 1s always an integer and calculate it for n <5. 


5. Let 
(-z2-2) 1 =) Fz" 
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10. 


11. 


12. 


Prove that 
Fo =F, =1 Fy =Fy-1+Fn-2 (n> 2) 


This is the recursive definition of the Fibonacci numbers 1, 1,2,3,5,8, 13,.... 
By expanding (1 — z— z*)~! in partial fractions, prove that 


2 1 1 
1 ie i n+ 1— Js n+ 
F,=— 
J5 2 2 
Investigate analogous results to Exercise 5 on the expansion of (1 — az — bz 
fora,b €C. 


2-1 


If 
ra 
=e (=) = Dan" 
prove that 
1f/n-1 
a = — 
a am =>-- ( =) 
S: 

Let f(z) have Taylor series )*a,z" for |z| < R. Let w = e?71/3 and define 


g(z) = x(f(z) + f(wz) + f(w*z)) 
Show that 
g@) =) aanz” 


for |z| < R. Find similar expressions for )* a3n412°"+! and )* a3n422°"*?. (Hint: 
l+o+a =0.) 
Define three functions by 


gn zont2 zgont 
ae Ge P= x, Gam Yer Dd Gn+ 1! 
Prove the series converge for all z. Using Exercise 8, prove the following: 
(i) a’(z) = B) PR=yv@ y'@)=az) 
(ii) a(z) = z[e% + 2e~2/? cos(zV3/2)]; find similar expressions for A(z), 7 (z). 
(iii) a(Z + w) = ae(z)o(w) + B(Z)y(w) + y@)BW) 
(iv) A@)+P@+v*@=1 


(v) a(z) = Biz) <=> z= Gn—1)- eZ) 


Generalise Exercise 8 to find an expression for )* Apna” (p = 2,3, ...), and (harder) 
for ¥ aig (pS 2,3 ectk =O long p=) 

Show that the Cauchy estimate is an equality if and only if f(z) = Kz” for some 
KeC,n=1,2,3,.... 

Let D be a disc centre zo, and let f be differentiable in a domain containing D. 
Prove that the ‘mean value’ of f(z) as z runs over 0D (defined by a suitable integral) 
is equal to f (zo). 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 
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Let f be differentiable throughout C, and suppose that |f(z)| < K|z|° for a real 
constant K and positive integer c. Prove that f is a polynomial function of degree 
<C. 


Let f and g be differentiable on the strip D = {z € C : —2 < imz < 2}. Suppose 
that f(z) = g(z) for all z such that |z| < 0.01. By considering Taylor expansions first 
about 0, then 1, and so on by induction, prove that f(z) = g(z) on D. 


Suppose that f(z) = }° an(z — zo)" in a disc D centre zo, radius R. If0 <r < R, 
show that 


1 2 
i0)\249 — 2..2n 
| | f(zo + re) ?'dO =) ~ \anl?r 
(Parseval’s Inequality). Hence show that 


Yo lan PP" < M(r) = sup | f(Zo + re!®)| 
7) 


Use this to give an alternative proof of the Maximum Modulus Theorem. 


If p(z) is a polynomial of degree n, show that for each R > 0 the ‘level curve’ of 
[p(z)|, defined to be {z € C: |p(z)| = R}, has at most n connected components. 


Suppose that x?-++ y” < a’. Prove that (1 +x)? + y? attains its maximum value when 
x = a,y = O. (Hint: apply the Maximum Modulus Theorem to 1 + z on the disc 
Iz| = a.) 

Suppose that ce y < 1. Prove that (x7 — y* eb aes 4x75? attains its maximum 
value when x = 0,y = +1. 


If f is differentiable in a domain D, prove that the zeros of f are either all of finite 
order and isolated, or f is identically zero on D. 


Let 
(oe) 
f(x) = / pea 
x 
where x is real and positive. By repeated integration by parts, show that if 
hy(x) = (—1)" nlx YD 
then 


FOR) = ig) + fy) 4 + Bg) + (CL / erat 


x 


Show that the series 
CO 
So hina) (10.6) 


diverges for all x. Show also that 


o- Shs < Nix Wt) 
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so that for large enough x the series (10.6) provides an arbitrarily good approxima- 
tion to f(x), even though it diverges. 

(The catch is that, the better the approximation required, the larger x has to be. 
No particular choice of x gives an arbitrarily good approximation if we take N large 
enough. A series with this property is called an asymptotic series.) 
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The Taylor series expansion is too limited for many applications. A useful generalisation 
was given by Laurent in 1843. Weierstrass may have discovered it two years earlier, but 
his paper was not published until after his death. Laurent considered “power series’ 
involving negative powers as well as positive. The benefits that accrue are hinted at by 
the following example. The function f(z) = el? is very badly behaved as regards 
Taylor series expansion. We have seen that, restricted to the real line, its Taylor series 
about the origin is 0 + Ox-+ Ox? +---, which does not converge to f(x). On the complex 
plane it is, if such a statement makes sense, even less capable of being represented by 
a Taylor series. The natural series representation is obtained by starting with the Taylor 


series for e* and replacing z by — 1/2’, giving 


1 1 
fQ=lag* tot + oro +. 


and this is a series of the ‘negative powers’ type. It converges for all z for which —1/z? 
is defined, namely z 4 0. 


Series Involving Negative Powers 


The general series of this type can be written in the form 


[o,2) 


Yo an(z — 20)" 


n=—0OO 


which is to be thought of as a compact notation for 


(>: an(Z — 2") cs (>: a_n(Z— a) 
n=0 n=l 


and hence converges if and only if the two series in parentheses converge. We know that 
power series converge inside a disc. Consequently power series with negative powers 
alone should converge outside a disc. For instance, that for e~!/ 2 converges outside the 
disc |z| = 0. Therefore those with both positive and negative powers should converge in 
a region between two concentric circles. Recall that such a region is called an annulus. 
More precisely, if Rj, Ry € RwithO < Ry < Ro < o&, and zy € C, then 


{2 €C:R <|z —zol < Ro} 
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is an annulus. 


We begin with an existence theorem for series expansions of the above kind. 


THEOREM 11.1 (Laurent’s Theorem). /f f is differentiable in the annulus R, < |z — 


zo| < R2 where 0 < Ri < R2 < &, then 


Ce Ce 
feo +h) = Yo anh" + > bah 
n=0 n=1 


where )~a,h" converges for |h| < Rz, >\b,h~”" converges for |h| > Ry, and in 


particular both sides converge in the interior of the annulus. 
Further, if C,(t) = zo + re“, (t € [0,27] then 


sf ot 
"Oi C, (z—zoy't! 


b, = | f (zz — 2)” dz 
271 Jc, 


Before giving the proof: 


REMARK 11.2. In the more compact notation we set Cn = dn (n > 0) and c_y = by 


(n > 1). Then 


[o,@) 


fleo+h) = D> enh" 


n=—0OO 


which converges in the interior of the annulus, and 
foe econ 
" Ini to, (@—z0)"t! 


Proof of Theorem 11.1. If Ry < |h| < Ry, choose r,, 7 such that 


for all n € Z. 


Ri <1 <|h|<1r<R 
and let 


Cy) =zot ne” (t € [0,27]) 
C,,) =z + me" (t € [0,27]) 


as in Figure 11.1. 
We show first that 


1 f(2) 1 i f(@) 
th) = d d 
EO a I, G@—G+th) aide, @—Go+M) - 


To do so, enclose zo + / ina small circle 


S.(t) =z + ee" (t € [0,27]) 


Then the function 
f@) 


RA 
aoe ere 


(11.1) 


11.1 Series Involving Negative Powers 


Figure 11.1. An annulus, showing paths used to prove Theorem 11.1. 


is differentiable in 
S={zeC:R <|z—-zl <R.zFwWH+h} 


and the contours —C,,, C,,, Se satisfy the hypotheses of Theorem 8.9. Therefore 


[. Fonte + f Fede+ f F(zdz = 0 
Cry r 


1 Se 


Hence 


i F(z)dz = F(z)dz — i F(z)dz 
Se Cry C 


rl 


But by Cauchy’s Integral Formula, 
[ Fed = 2i-fa +9 
Se 


giving (11.1). 
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(Alternatively we can make cuts and integrate round the two halves, as shown in 


Figure 11.2. The parts of the contours along the cuts cancel in pairs when integrated.) 


All we need do now is work out the two integrals in (11.1) as power series, and 


calculate the coefficients. Unfortunately this is the longer part of the proof, although the 


calculations are routine. 
First, we choose (1, 02 with 


r<pi <|h|<pr2<1re2 
which enforces the conditions: 


(i) |1z— (Zo +h)| > r2 — 2 for z on C,,; and 
(ii) |z—(zo + A)| > p1 — 11 for z on C,,. 
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Figure 11.2 Alternative proof using cuts. 


As in the proof of the Taylor series expansion, Lemma 10.2, we get 


CO 
= | FQ) dz =) a,h" 
Q7i Ce Gor h) =a 


ne: / f@) ae 
"Oni aa (z— zo)?! 


The treatment of the second integral is similar, but will be given in full. Since 


for |h| < 2, where 


1 z-m, , &-z)"" g—zy =i 
hi ho! hn h'(z—zy—h)  z—2y—h 


(summing a geometric series) we have 


1 f@ 2 1, ,@-a)y"  @=2)" 
271 | ean" x3 |. Fo F nn na l* 


2 


n 
= Do bmh™” — Bn 
m=1 


where 
1 
bn = == / F(2)(z — 20)" "dz 
271 on 


1 f(2(z — Zo)" 
2mi Jc, A(z — zo — h) 


By = 


Finally we estimate the size of B,. There exists M > 0 such that | f(z)| < M on C,, 
since this circle is closed and bounded. By (1) above, |z — zo —h| > o1 — 1. Also 
|h| > o1,|z — zo] = r1. Hence 


n n 
B,| < as oe nr, = Mn (2) 
2m py (p1 — r1) pi-n \pi 
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which tends to 0 as b= n— on, since r;/p1 < 1. It follows that 


1 f@ 
b,h-™ 
Qi i. (Z—Z — a -¥ 


To finish the proof we must replace C,, and C,, by C;, in the expressions for a, and 


by. Since all three paths are homotopic inside the annulus, or by using cuts, this is 
immediate. 


Note that we can no longer assert that a, = f(zo)/n! since f(z) need not be 
differentiable for |z — zo| < Ri under the stated hypotheses. 


DEFINITION 11.3. The Laurent series or Laurent expansion of f(z) about zo is the 


series 
CO 


> Chh" 


n=—0OO 


where h = z — zo and cy is defined above. 


THEOREM 11.4. The Laurent expansion of f about zo is unique. 


Proof. Suppose that 


{QO= > dn(z — zo)" 


n=—0OO 


Then 


dn 
20)" f() = 8 dn(z— zo)" "| =fi@t zp TR 


n=—0oO 


where 


AO= >, an)" 


[o,2) 


Ap@= >> de-a)"" 


n=m+1 


But each of f,,/> has an antiderivative in Ry < |z— Z| < Ro, because we can integrate 
term by term. That is, set 


F\(z) = 3 _ a= ay 
Re at ‘ 
i= > aeoa 
n 
n=m+1 a 


It is easy to check that the series converge absolutely for Rj < |z — z| < Rp» and that 
Fi (2) = fiz), F5(z) = f2(z). Hence 
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d 
F(z — 20)" "dz = / midz =27idm 
Cr Cy & — £0 
SO dm = Cm as defined above. 
Example 11.5. Let f(z) = e% + e!/<. We know that 
=~ 1 
e& = > a for all z 
n=0 
= t 
1/z _ a 
e Pe for all z #0 
n=0 
so 
CO 
f@= >» Cn 2" 
mMm>=CO 
where 
Cm = 1/m! (m > 1) 
co = 2 


Cm =1/(—m)! (m < -1) 


and the expansion is valid for z 4 0. 


Example 11.6. f(z) = : + — In a similar way, f(z) = }> ¢mz" where 
Zz —Z 


Cm =O (m< —1) 


Cm =1 (m> -—l1) 


1 1 
Example 11.7. f(z) = ——- — ——.. Writing this as 
gL. -Z—2 


1 1 
zd—1/z)  2(1 —2z/2) 


we obtain an expansion f(z) = }° cz" where 


Cm =1 (m<-l) 
Cn = 2°"tY) (m= 0) 


valid in the annulus 1 < |z| < 2. 


Isolated Singularities 


If f is differentiable in a punctured disc 


0 < |z—z|<R whereR >0O 
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we Say that zg is an isolated singularity of f. We can use the Laurent expansion to study 
such singularities. There is a Laurent series 


f) = Dane — 20)" + D> bale — 20)” 
n=0 n=1 


valid for 0 < |z— zo| < R. This series can behave in three radically different ways: 
(1) All by = 0. If we define f(zo) = ao we obtain a function that is differentiable on 
the whole disc |z — zo| < R, with Taylor series 


[e,2) 
Yan — z0)" 
n=0 


In this case Zg is said to be aremovable singularity. It arises from our choice of a domain 
of definition of f, rather than from any intrinsic feature of f. 
For example, consider 


sin Z 


f@= ae (z £0) 


Around zo = 0 we have 
2 4 


x 
bi aay T 51 


so by defining f(0) = 1 we get a function differentiable for all z € D. 
(2) Only finitely many b, 4 0. Then 


bn | | b | — _ n 
a (z— zo)" "Z= 20. aaa ae 


where b», 4 0. In this case we say that f has a pole of order m at Zo. 
For example 


f@=z tsinz (z #0) 
1 1 0° . gant 
3B  3lz | a ” (2n + 5)! 


has a pole of order 3 at zo = 0. 
(2) Infinitely many b, ~ 0. Then we say that zo is an isolated essential singularity. 
For example 


f@ =sin(l/z) (2 #0) 
at. 44 4 
z 3I3 5125 


has an isolated essential singularity at 7 = 0. 
We investigate the behaviour of a differentiable function near an isolated singularity, 
according to these three possibilities. 
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Behaviour Near an Isolated Singularity 


Removable singularities are trivial and uninteresting — which make it all the more 
important to recognise them. The next lemma is usually sufficient for this purpose. 


LEMMA 11.8. The following are equivalent for a function f that is differentiable for 
Q < |g —29| < K? 


(i) 2 is a removable singularity of f. 
(ii) lim,_,., f(z) exists and is finite. 
(iii) There exist M > 0,6 > 0 such that | f(2| < M for 0 < |z— zo| <6. 


Proof. Trivially (i) = Gi) => (iii), so we need prove only that (iii) > (i). 
Suppose (iii) holds, and take a Laurent expansion 


CO [o,@) 
4@= > m@—wy + Dba — 20) 
n=0 n=1 
Now 
1 n—1 
bi = =~ | f@K—2)" dz 
211 Jc, 
where C,(t) = zo + re“ (t € [0,27]) and0 <r <R. So 
1 
lbn| < —Mr'""! . 2a r = Mr’ 
20 


if we let r + 0 it follows that |b,| = 0 for alln > 1. Thus zg is a removable singularity 
and (i) holds. 


We immediately deduce the useful: 


COROLLARY 11.9. If any coefficient b, 4 0 (n => 1) then f is unbounded on every 
open disc with centre zo. 


For example, if 
2 


z 
I@= a) ae (<< #0) 


lim f(z) = lim ( —* é 
z30 z>0 Lee — 1 sin Z 


é Zz = Zz ~ 
lim (1+ = +---) (-F+-) 


= 1 


then 


so Zo = 0 is a removable singularity. 
There is a similar, slightly more complicated, criterion for poles. 
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PROPOSITION 11.10. [ff is differentiable for 0 < |z — Z| < R, then f has a pole of 
order m at zo if and only if 


lim — 20) "f(—) =140 


Proof. If f has a pole of order m then 


(z— zo) f(z) = bm +++ +b1(z— zo)" 1 + by an(z— zo)" 
n=0 


so that 
lim@ — £0)" F(Z) = dm #0 
LZ 


Conversely, if the limit is ] A 0, then g(z) = (z — zo)/"f(z) has a removable singularity 
at zo by Lemma 11.8, so there is a series 


g(2) = ) > an(z — 20)" 
n=0 


valid for |z — z| < R, andap = 14 0. But now 


f@= ia Ld Y | antm(z— 20)" 
n=0 


(z — zo)” Z—20 


where ao 4 0, so f has a pole of order m at zo. 


For example, consider 


Ae ae, 
(2) = ——.—, OK kl < 1) 
f (1 — z)3 sin’ z 
Then lim,_,9 z*f(z) = 3 4 0, so there is a double pole (pole of order 2) at the origin. 
Also, lim,_s 1(z — 1)3f(z) = —8/sin’(1) ¥ 0, so there is a triple pole (pole of order 3) 
at zo = 1. 


COROLLARY 11.11. The function f(z) has a pole of order m at zo if and only if 1/f(z) 
has a removable singularity at zy which, if removed, gives rise to a zero of order m at 


zo. This in particular occurs when 1/f (z) has a zero of order m at z. 


Proof. If f(z has a pole of order m at zo then g(z) = (z — zo)”f(z) has a removable 
singularity at zo. Further, these exists 6 > 0 such that g(z) # 0 for |z — zo| < 6. So 


1/f(z) = (@ — zo)” /3(z) 


and 1/g(z) is differentiable for |z — zo| < 5. Therefore, with the singularity removed, 
1/f(z) has a zero of order m at zo. 


The converse is proved by an almost identical argument. 


COROLLARY 11.12. If f(z) has a pole of order m at zo, then 


lim | f(z)| = +oo 
Z>% 
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Proof. 
Jim If(I = Jim Ig(z)/(z — x)" | 


= |J|- lim |z—zol™ 
2H 


=+0o 


Thus near a pole the behaviour of f is really quite good. However, near an isolated 
essential singularity the behaviour is much wilder. The following result is classical: 


THEOREM 11.13 (Weierstrass—Casorati Theorem). In every neighbourhood of an iso- 
lated essential singularity zo a differentiable function f takes values arbitrarily close to 
any assigned complex number. Specifically, given r > 0,€ > 0, and w € C there exists 
z, such that |z, — Z| < Rand | f(z) —w| <e. 


Proof. We have 
Co ee) 
f) = Yo ane — 20)" + D5 baz — 20)” 
n=0 n=1 


with infinitely many b, 4 0. If we define ¢(z) = f(z) — w then the Laurent series of 
(z) differs from that of f(z) only in the coefficient ag, which becomes dy — w. Hence 
(z) also has an isolated essential singularity at zo, so we need prove only that #(z) can 
be made arbitrarily small in any neighbourhood of zo. That is, there exists z; with 


0 < |z1 —zo| <r |P(z1)| < € 


If zp is a limit of zeros of @ this is trivial. Failing this, there exists p > O such that 
o(z) # 0 for 0 < |z — z| < p. Either there exists z,; with O < |z; — zo| < r and 
1/0(z1) > 1/e, or else 1/@(z) is bounded for 0 < |z; — zo| < r. If the former, then 
the theorem is proved. If the latter, then 1/(z) has a removable singularity at zo by 
Lemma 11.8, so by Corollary 11.11 (z) has at worst a pole at z, which contradicts zp 


being essential. Thus the latter case cannot occur. 


In point of fact, a stronger and more satisfying result is true. Emile Picard (1856- 
1941) proved that in every neighbourhood of an isolated essential singularity, a 
differentiable function f takes every value, with at most one exception. For instance, 
sin(1/z) takes every value in 0 < |z| < r for any r > 0, as can easily be verified. The 
exception can occur: e!/% misses out the value 0 but attains all others in 0 < |z| <r 
for any r > O. But the proof of Picard’s Theorem requires machinery (elliptic modular 
functions) considerably beyond the reach of this text. 


The Extended Complex Plane, or Riemann Sphere 


In real analysis it is standard to extend the real line R by adjoining two points at infin- 
ity, +-oo and —oo. We then have a simple way to discuss and visualise limits such as 
limy-+ +00 1 /x and lim,_, —o9 e*. 
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Sometimes it is more helpful to consider these two points to be identical. For example, 
the graph of y = 1/x is a hyperbola, which heads off to too along both axes. If we 
identify +-co with —oo (known as the one-point-compactification of R) the extended 
hyperbola closes up to form a closed curve, like an ellipse. Both of these curves are conic 
sections, and in projective geometry they are projectively equivalent. So the projective 
line extends R by adjoining a single point at infinity. 

There is a similar extension of the complex plane, which lets us describe the behaviour 
of a complex function ‘at infinity’ by adjoining to C a single extra point ‘oo’. We adjoin 
just one point in the complex case because the distinction between the two ends of the 
real line becomes blurred when we deal with the whole plane. For instance, if we rotate 
C continuously through z, +-oo and —oo in R swap places. The method for adjoining 
this extra point was introduced by Bernhard Riemann, and has an elegant geometric 
realisation. 

Think of C as being embedded in the (x, y)-plane in R°, so that a point x + iy € C is 
identified with (x, y,0) € R>. Let 


VP={En OER 2847 +e7=)1} 


be the unit sphere. A line joining the North pole (0, 0, 1) to (x,y, 0) cuts S* in a unique 
point (€,7,¢), giving a one-to-one correspondence between C and all points on S? 
except the North pole, Figure 11.3. 

It is easy to verify that 


3 cab om 
(€,n,¢) corresponds to (+) (77) 


As (&,7, €) gets near to (0,0, 1) it follows (as is obvious geometrically) that |x + iy| 
becomes very large. Thus it is reasonable to introduce the symbol 


[© @) 


Figure 11.3 The Riemann sphere. 
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f 


Figure 11.4 Neighbourhood of oo on the Riemann sphere. 


to correspond to (0,0, 1) € S*. Thus we have a one-to-one correspondence between S” 
and C U {oo}. We call the latter the extended complex plane. It may be identified with 
S*, which is then known as the Riemann sphere. 

We can now think of C U {oo} either as a plane plus an extra point, or as a sphere. 
Correspondingly, we think of C as a plane, or as a sphere without a North pole. Both 
viewpoints are valuable, depending on the problem at hand. 

Since {x + iy : |x + iy| > R} corresponds to a ‘spherical cap’ between a line of 
latitude and the North pole (Figure 11.4) it makes sense to think of {z € C: |z| > R} as 
a ‘neighbourhood of oo’. Such neighbourhoods get smaller as R gets bigger. So doing 
this leads to a concept of continuity on S*, agreeing with geometric intuition (and the 
standard distance function on R*). Readers familiar with topology can render this in 
more precise terms: we obtain a topology on the Riemann sphere that is identical to its 
usual topology as a subset of R?. 


Behaviour of a Differentiable Function at Infinity 


Suppose that f(z) is differentiable in {z € C : |z| > R}. Then we can define 


sy =fU/z) O< lzl < 1/R) 


Since g/(z) = —z~?f'(1/z) it follows that g(z) is differentiable for 0 < |z| < 1/R, and 
therefore has an isolated singularity at 0. 


DEFINITION 11.14. The function f has a removable singularity, pole of order m, or 
isolated essential singularity at oo if and only if g(z) = f(1/z) has the corresponding 
type of singularity at 0. 
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Example 11.15. f(z) = 1/z ({z| > 0). Then g(z) = z (|z| > 0), sof has a removable 
singularity at oo. 


Example 11.16. f(z) = z. Then g(z) = 1/z((z| > 0), so f has a simple pole at oo. 


Example 11.17. f(z) = e%. Then g(z) = e!/< (\z| > 0) and f has an isolated essential 
singularity at oo. 


Example 11.18. f(z) = 1/(sinz) (¢ #Anz,n € Z). Then g(z) = 1/ sin(1/z). We cannot 
say that f has an isolated essential singularity at oo, because f is not differentiable in 
{z€C: |z| > R} for any R > 0. However, f certainly has some sort of singularity at oo. 
In general, if f(z) has a sequence of isolated singularities z, such that z, — oo, we say 
that f has an essential (but not isolated) singularity at oo. 


If f has a removable singularity at oo then g has a removable singularity at 0, so 
may be rendered differentiable in {z € C : |z| < 1/R}. Thus we may say that f is 
differentiable (or analytic) at oo, and define 


f(co) = g(0) = lim g@) 
and now 
flo) = lim fle) 


For instance, if f(z) = 1/z then g(0) = 0 sof(oo) = 0. 
Further, we say that f has a zero of order m at oo if g has a zero of order m at 0. Thus 
if f(z) = 1/z” (|z| > 0,m € Z,m > 0) then g(z) = z” and f has a zero of order m at oo. 
In general, any statement about the behaviour of f at oo can be translated into one 
about g at 0 — and this is how to define or prove such a statement. 


Meromorphic Functions 
Poles are not, as singularities go, particularly nasty, and it is of interest to consider a 
class of functions more general than differentiable ones. 


DEFINITION 11.19. If f is differentiable everywhere in a domain D except for points 
at which f has a pole, then f is meromorphic in D. 


For instance, f(z) = 1/z is differentiable in C \ {0}, and has a pole at 0, so it is 
meromorphic in C. Similarly, f(z) = 1/(sinz) is differentiable in C \ {nz : n € Z}, and 
has poles at nz, so it is meromorphic in C. 

We might go further, and consider functions meromorphic in the extended complex 
plane — by which we mean that the only singularities of f, including oo, if necessary are 
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poles. It is not necessary to introduce a special term for such functions, because they 
turn out to have a simple description. Recall that a rational function is one of the form 


(2 = O()/W) 


where ¢, ¥ are polynomial functions. Then we have: 


PROPOSITION 11.20. A function is meromorphic in the extended complex plane if and 
only if it is rational. 


Proof. Clearly a rational function is meromorphic in C U {oo}. 
Suppose conversely that f is meromorphic in C U {oo}. Since f is differentiable at co 
or has a pole at oo, there exists R > O such that f is differentiable in 


{zEC: |z| > R} 
So the poles, other than oo, occur inside the closed disc 
{zeEC: |z| < R} 


Since poles are isolated singularities, each pole has a neighbourhood that contains no 
other poles. Indeed, if there were an infinite set S of poles inside |z| < R, there would 
be a limit point of S. This is not possible because the poles are isolated. This closed 
disc contains only finitely many poles, say z1,..., z¢. Let the orders of these poles be 
n1,...,Nk. Define 


az) = @— 21)" +++ — a) 


This is differentiable throughout C, hence analytic, so it has a Taylor series 


e(z) = yas (z€C) 


n=0 
Now the polynomial 


(z) = (z— 21)" +--+ (z@—- HY 


has a pole of order nj +--+ + nx at 00, since 


— 
—n 
XN |e 
NS” 
II 
—s 
[ee 
| 
2 
Nee 
= 
, 
| = 
| 
NX 
~ 
Ne 
= 


= Zaye big + 


Since f has at worst a pole of order N at oo for some N it follows that g(z) has a pole of 
order 


M=n+---+m+N 


at oo. Now 


g(1/z)= Yoanz" ([z| > 0) 
n=0 
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and since this has a pole of order M at 0 we must have a, = 0 forn > M. Therefore 


g(z) = an +--+» + ame" 


which is a polynomial, so f(z) = g(z)/w(z), which is rational. 


A function meromorphic only in C need not be rational, as the example 1/(sin z) 
shows. This is not rational since it has infinitely many poles, but we have already seen 
that it is meromorphic in C. So the behaviour at infinity is important. 
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1. Find the Laurent expansions of the following functions of z around z = 0: 
G@ @=3ay* 
(ii) @-—a)-* @e Ck =1,2,3,...) 
Gui) 1/(z(1 — z)) 
(iv) 1/(z—a)@ — b)) (a,b, € C) 
(v) Ze! /z 
(vi) exp(z + 1/z) 
(vii) cos(1/z) 
(viii) exp(z~>) 
In each case, specify the largest annulus in which the expansion is valid. 
2. Find the Laurent expansions of the following functions on the stated annulus: 
@) (¢ —1)-%(z — 2)! on « |z| <1 
ii) (z — 1)72(z —2)7! on 1 < |z| <2 
ii) ( — 1)-2(z — 2)—! on2 < |z| <3 
(iv) exp(—z~*) on |z| > 0 
(v) (1 — z—2z*)~! in powers of z— 1 on 0 < |z— 1| < 1 
(vi) e®/(1 +27) on [z| > 1 
3. Which of the following functions have a Laurent expansion around the given point 
zo (that is, in powers of z — zo)? (For multivalued functions, choose one particular 
definition on a domain that makes it single-valued.) 
(i) ./z about zo = 1 
(ii) ./Z about zy) = 0 
(iii) Log z about z = 0 
(iv) Log z about zp = 3 
(v) 1+ /Z about zo = 0 
(vi) tan7!(1 + z) about zo = 0 
(vii) sin~!(z) about zo = 0 
(viii) V (2/2) — sin~!(z) about zo = 1 
(ix) z’cosec(1/z) about zo = 0 


(x) J @x/4) — sin“ (2) about zo = V2 
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4. Prove the validity of the Laurent expansion 


1 CO CO 
@-DE-D & a 
on a suitable annulus, and state which annulus. 
5. Find Laurent series for (z* — 1)~! and (z2 + 1)~! in powers of z+ i and z — i, and 
say in which annuli these are valid. 


6. Let a,b € C. Show that 


CO 
exp(az + bz~!) = y az” 
—0O 
where 
1 


2n 
Un = al e+) 00s? cosi(q — b) sind — nO dO 
20 0 


7. Let a € C. Show that 


sin(a(z + z~!)) = Ss Anz" 


where 
1 2n 
an = = / sin(2a cos 8) cos nO d@ 
20 (0) 


8. Find the poles of the functions: 
@ I/(@ +1) 
Gi) 1/@* + 16) 
(iii) 1/(z* + 22? + 1) 
(iv) 1/@+z-1) 
9. Describe the type of singularity at 0 of each of the following functions: 
Gi) sin(1/z) @ £0) 
(ii) z~3 sin’ z 
(ili) zcotz (¢ Anz,n € Z) 
(iv) cosec?z— 2-7 (¢ £nn,n€ Z) 
(v) (cosz — L)/z 
(vi) (sinz —z +.2°/6)/z! 
10. Let D be a disc centre zg, let f be differentiable on D except at zo, and suppose that 
| f(z)| is bounded on D \ {zo}. Show that zg is a removable singularity of f. (Hint: 
negative Laurent terms are 0 — why?) 


11. Find all singularities of the following functions, and say which are poles: 


@ @tz'y! 
ii) COS TZ 
a 1 — 42? 


(iii) exp(z +z!) 
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12. Construct a function defined on C U {00} having only the following singularities: 
(i) A pole of order 2 at oo. 
Gi) A simple pole at each of the points ertikip k=0,... ,p — 1, p an odd integer 
> 2. 
(iii) A simple pole at z = 2 and a pole of order 5 at z= ./2. 
13. Let p(z), g(z) be polynomials of degrees m,n respectively. Describe the behaviour at 
infinity of: 
@) p@) + 4) 
(i) p@)q@) 
(ii) p@)/q@) 
14. Find all singularities, and the behaviour at infinity, of the functions: 
ese )\- 
Gi) 2/2-2/ 
(iii) (e@ — 17! - 1/z 
(iv) cot 1/z 
(v) (cosz)z~ 
(vi) ((cos z) — 1)z~? 
(vii) cot(1/z) — 1/z 
(viii) sin(1/ cos(1/z)) 
15. Let IP be a circle in the complex plane, or a straight line. Is its image on the Riemann 
sphere also a circle? 
16. Find the poles and zeros of tan z. Show that tanz is meromorphic in C, but is not a 
rational function. 
17. Show that (z + 1 + z~')~! has a removable singularity at z = 0. Find its Taylor 
expansion, and the radius of convergence of this. 


2 


18. Verify Picard’s Theorem directly for the functions: 
(i) e& 
(ii) tan? z 
(iii) z 
(iv) sinz 
(v) el/z 
(vi) cos z 
(vii) tanz 
(viii) A function of your own choice. 

19. Let f(z) have a pole of ordern at z = a. Define the principal part $(z) of f(z) to be 
the sum of the negative-power terms in the Laurent expansion of f(z) in powers of 
z—a. Prove that f(z) — #(z) is differentiable at z = a. 

20. Show that in a neighbourhood of a pole, a complex function is the sum of a rational 
function and a differentiable one. 


21. Suppose that f is differentiable on C except at poles, and that oo is either a pole or 
a removable singularity of f. Show that: 
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(i) f has only finitely many poles. 
(ii) f — }>,pr is constant, where the poles of f are at points a; and the p, are the 
corresponding principal parts of f (defined in Exercise 19). 
(iii) f is a rational function, and )°, p; is its ‘partial fraction’ decomposition. 
22. Let f : C > C be differentiable, with f(z) # 0 for all z € C. Suppose that 
lim,_, 600 f(z) exists and is non-zero. Prove that f is constant. 
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12.1 


Residues 


Among the many applications of complex analysis are the explicit computation of def- 
inite integrals, the summation of series, and counting how many zeros a function has in 
a given region of C. Although such questions are not as important a part of pure math- 
ematics as they once were, they are still very relevant to practical applications. Further, 
the power of the method and its wide applicability demonstrate the advantage of general 
principles and deep theorems over manipulative ingenuity. 

The basic idea in all of these applications is to use Cauchy’s Theorem to exploit the 
exceptional nature of the term b,/(z — Zo) in the Laurent expansion of a differentiable 
function. 


Cauchy’s Residue Theorem 


The residue of f at zo is determined by the special coefficient b, just described: 
DEFINITION 12.1. If f has an isolated singularity at z) and a Laurent expansion 


fco+h) = do anh" + Y* bah” (O< lhl < R) 


n=0 n=1 


then the residue of f at zo is 


res(f,z0) = by 


From Theorem 11.1 we immediately deduce that 


1 
res (f,Z0) = aq | foe (12.1) 


where C,;(t) = zo +el! (t € [0,27 ]) and 0 <r < R. This shows the relevance of residues 
to integration. 


DEFINITION 12.2. A closed path y is a simple loop if every point z not on y has 


Wy, z) = Oorw(y,z) = 1. 
As usual, the set of points z € C satisfying w(y, z) 4 0 (which in this case means 


w(y, Z) = 1) are said to be inside y. 


In applications, simple loops will all be made up of straight line segments and parts of 
circles, as in Figure 12.1. All the simple loops encountered will be Jordan contours (that 


244 


Residues 


Figure 12.1 Simple loops in C. 


Figure 12.2 A simple loop in C that is not a Jordan contour. 


is, they do not self-intersect), but that is not essential for the theory. (Figure 12.2 shows 
a simple loop according to our definition that is not a Jordan contour.) What matters is 
that the points inside y all have w(y, z) = 1. dn particular, y winds once anticlockwise 
around such points.) 

In such cases, we have: 


THEOREM 12.3 (Cauchy’s Residue Theorem). Let D be a domain containing a simple 
loop y and the points inside y. If f is differentiable in D except for finitely many isolated 
singularities at Z1,...,Zy, inside y, then 


i, f(gdz = 2i J tes (f,2z)) 
Y r=l1 


Proof. Since D is open we can find circles S,(t) = z + €,el! (t € [0,27]) round the z, 
such that S, and points inside it lie inside D, and such that S; contains no singularity of 
f other than z,, Figure 12.3. Then the collection of paths 


—y,S1,. os Sn 
satisfies the hypotheses of Theorem 8.9, because 
w(-y,z) = W(S;,z)=0 (z ¢ D) 


w(—y,2Zr) = —w(y,zZr)=-—1_ since z; is inside y 
WSysZ-) = OifmsAr, 1 ifm=r 


12.1 Cauchy’s Residue Theorem 


D 


Figure 12.3 Small circular contours around each isolated singularity. 


Therefore 


for all z Z D. 
By the Generalised Cauchy Theorem, 


[rt [pte [ f=0 
eee, 


= mi: res(f, 21) + +++ +2mi- res(f, Zn) 


so that 


by (12.1). 
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Alternative Proof. Instead of the Generalised Cauchy Theorem we can use Laurent 


series: the proof is instructive but less elegant. Around each z, there is a Laurent 


expansion of f, which we split into three parts: 
res (f, z;) 
f(2) = Q(z) + aw aT Pr 


where 


Qr(z) = > bn(z = Zr)” 


P,(2) = Do ane — zr)" 


(12.2) 


Now P,(z) is differentiable in a neighbourhood of z,, and Q,(z) + res (f,z,)/(z — Z,) 1S 


differentiable for z 4 z, since all but a finite number of the by, are zero. Therefore 


hi) =f(@) — J 0,@) - y SL) 


r=1 j=l 


(12.3) 
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(12.4) 


is differentiable in D, except perhaps at z1,... , Z,. But from (12.2) itis also differentiable 
in a neighbourhood of each z,. Hence h(z) is differentiable in D, so by Cauchy’s Theorem 
/ h(z)dz = 0 
y 

But Q,(z) = T;(z) where 

Ob 

T@=) = qe-w 
n=2 


so that 
i Q,(z)dz = 0 
Y 


as well. Integrating round y and using (12.4) and (12.5), we get 


0= f fede -0- Prestfizy f ay Mae 
Y r=1 Y 


= [ fede — ani) res(f,) 
yy. r=! 


Calculating Residues 


(12.5) 


The Residue Theorem can be used to calculate integrals — and not just round simple 
loops. For it to be of much use, we must find ways to calculate residues. The following 


two lemmas are very useful in this respect. 


LEMMA 12.4. (i) If Z is a simple pole of f then 


res (f,z0) = Jim (Z — zo)f (z) 


(ii) If f(z) = p(z)/q(z) where p(zo) # 9, g(z0) = 0, q’'(zo) 4 0, then 


res (f, 20) = p(zo)/q (zo) 


Proof. We have already done part (i) in the previous chapter as Proposition 11.10, but 


to recap: we have 


= by ~ = n 
fQ= 2 + d an(z — z0) 


— Z0 


and so 


(— zo) = bi + > an — z0)"* 


n=0 
which tends to bj as z > zo. 
For (ii), note that 
lim (z= z0)P@ _ mae (2 — q(Zo) 
Z>Z0 q(z) Z>2 Z—% 


since g(zo) = 0, and this is equal to p(z)/q'(zo). 


) 
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For example, if 


COS 7 Z 
f(Q = p16 


then 
cOsz 1 
976 - 1975 976 


LEMMA 12.5. If zo is a pole of f of order m then 


res(f,1) = 


1 qv-! - 
res (f, Zo) = lim ((Z — 2) (2)| 


z>z0 | (m — 1)! dzm-1 


Proof. We have 


bmn | | by \ — _ n 
f@) G— a" T t a T Dante Z0) 
so that 
(Z— m)"£(@) = bm +++ + dE — WY" + Yo age — zo)" 
n=0 
Therefore 
ee a (mtn)! ‘ 
ganar (& — 20)"f(@) = (m— Diby 4 d pyre 2) a 


Now take limits. 


For example, consider 


£47\° 
{@O= (55) 


which has a triple pole at zo = 1. Then 
@-)F@O=e+P 
and so 
1@¢ 
2! de? 
which tends to 3 -2 = 6 as z— 1. Sores(f, 1) = 6. 
On occasion another technique may be brought into play: working out the appropri- 


(@ — 1)3f(2) = oat 1) 


ate part of the Laurent series. (It is a waste to work out the whole thing, because the 
whole point about residues is that we do not need the whole thing, but only b;.) For 
instance: 


II 
— 
~ 
N 
i) 
N 
| 
| “bo 
+ 
NE 
NE 
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so that res(f,0) = 1/6. 


12.3 Evaluation of Definite Integrals 


We now consider a number of techniques for calculating various kinds of definite 
integral. 


2n 
I: f O(cos t, sint)dt 
0 


Let C(t) = e (t € [0, 27]) be the unit circle. If 
z=Ci) =e" 


then 


from which we get 


QO(cos t, sin t)dt = / Q Z4 >a IZ : 
0 C 2 zJ 21 Zz 1Z 


= 271d 


where » is the sum of the residues of 


1 1 1 1 1 126 
Lo(s(ert)-A@-) es 


inside C. 
For example, consider 
20 
, (cos? t + sin? t)dt 
0 
Then (12.6) becomes 
L fig. 1\. 1 1" 
iz \8 - z ae" z 
Meg -t .o - e D Lge 


as T T 


8 di | Bi ODiz | iS | Bid 
which has a pole inside C with residue 1/21. So the integral is 271/21 = 7. 
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If Q is at all complicated, the computations can become very tedious. Sometimes 
integrals of this kind can be found from the real and imaginary parts of an integral 


/ g(z)dz 
Cc 


with a suitable choice of g. For instance, 


z 
i © de = Oi 
Cz 


since e*/z has residue 1 at z = 0. Therefore 


20 eC0s t+isint 
—_——dt= 271 
0 elt 


so 
20 _ 
/ eos t+isint gy =2n 
0 


and 


20 
i eS" Tcos(sint)) + isin(sint)]dt = 27 
0 


Equating real and imaginary parts, 
Qn 
/ eS‘ cos(sint) = 2a 
0 


20 
j eS! sin(sin t) = 0 
0 


Il: f - fax 


The real integral 


CO 
/ f (x)dx (12.7) 
—CO 
is defined to be 
x2 
lim / f(x)dx (12.8) 
X1—> —00, X2-+00 Jy, 


provided the limit exists. The techniques we are about to discuss allow the calculation 
of 


R 
lim / f(x)dx (12.9) 
R>oo —R 


which is known as the Cauchy principal value of the integral, denoted by 


P / - fx)dx 
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If (12.8) exists then so does (12.9) and the two are equal. But the Cauchy principal value 
may exist when (12.8) does not. For example 


R 
if xdx = [72s =0 
=R 


R 
pf xdx = 0 
—R 


But clearly (12.8) does not exist when f(x) = x. 
It follows that when we use this technique below, we must take into account the 


so that 


convergence of (12.8). This, in part, leads to condition (ii) of the following: 
PROPOSITION 12.6. Suppose that: 


(i) A function f is differentiable in the upper half-plane {z € C : imz > 0} except for a 
finite number of poles, none of which lies on the real axis. 
(ii) If Sp(t) = Re" (t € [0, 27]) then there is a constant A such that for large R 


’@)| < A/R? 
when z lies on Sp. 
Then 
[oe] 
/ f(x) dx = 271 
—CO 


where & is the sum of the residues of the poles of f in the upper half-plane. 


Proof. Choose R large enough for (ii) to hold, and so that all the poles lie inside 
Sr + [-R, R], Figure 12.4. 
By Cauchy’s Theorem, 


R 
/ f(x) de + / f(z)dz = 277i 
—R SR 
with & as stated. Now let R > oo. Then 
TA 


| | ee 
| ie! < Poa R 


R R 


Figure 12.4 Choice of R in proof of Proposition 12.6. 
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which tends to 0 as R > oo. Hence 
R 
lim fx) dx = 271d 
R->0oo _R 
That is, 


Pf foyax=2ni 


[o,2) 


Now, (ii) tells us that [f(x)| < A/x? for large |x|, so it follows from real analysis that 


i: Fe) dx 


exists. It is therefore equal to its Cauchy principal value, hence to 2771 X. 


REMARK 12.7. Condition (ii) is certainly satisfied if f(z) = p(z)/q(z) where p and q are 
polynomials such that g has no real zeros and the degree of qg is greater than or equal to 


the degree of p plus 2. 


For example, consider 


as dx 
I. Ge te ib) 

where a,b > 0 are real, a € b. By Remark 12.7 this satisfies (ii), and (i) is obviously 
true. The only poles of 1/((z* + a”)(z” + b”)) in the upper half-plane are simple poles at 
ia, ib. The residue at ia is 

i zZ—ia 1 

im = 
zoia (22 +.a*)\(z2 +b?) 2ia(b? — a?) 


and similarly that at ib is 
1 
2ib(a2 — b?) 


Therefore the value of the integral is 


2mi : ! : -_- 
2ia(b2 — a2) 2ib(a2 — b?) ab(a + b) 


REMARK 12.8. In the proof of Proposition 12.6 we do not require f(z) to be real for z 
on the real axis. 


For instance, let 
elz 
(2? + a?\(2? + B*) 


fW= 


Then on Sp we have |e“| = |e +"| = e <1 for y > 0, so (i) holds. As before, there 
are simple poles at ia, ib, but now the residues are 


e * e? 


Sa, RA 
Jia —@) ible? — bY 
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Hence 


oo ex foe et e2 
[. @+Q202+R) (a —@) ° b@ - ») 


so, equating real and imaginary parts, 


[- cos x de XT a ad 
oo (x? + a*)(x2 + 5?) «= P-@\ a b 


ye sinx deeb 
oo +A +R) 


Of these, the second equality is actually obvious (why?), but the first is not. 

We can try to generalise this method in (at least) two ways: by getting a better estimate 
for f Spf or by allowing f to have poles on the real axis. The first is dealt with in (III), 
the second in (IV). 


Il: f . f (x)e* dx 


PROPOSITION 12.9. Suppose that: 


(i) A function f is differentiable in a domain containing the upper half-plane {z € C : 
im z > 0} except for a finite number of poles, none on the real axis. 
(ii) There is a constant A such that for large R 


If(z)| < A/R_ for |zZ|] =R 
when z lies on Sr. Then 


J fove dx = 27id 


where & is the sum of the residues of the poles of f(z)e™ in the upper half-plane. 


Proof. It is possible to use the same contour as in (II) and prove that 


Roo 


lim i f(oe% dz = 0 
SR 


but this calculates only the Cauchy principal value. Moreover, the convergence problem 
in this case is much harder because all we know is that f behaves like 1/|x| for large |x|, 
which on its own does not imply the existence of 


/ * f (xje*dx 


More delicate arguments can overcome this obstacle, but it is easier to sidestep the whole 
question by using a different contour, as in Figure 12.5. 
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Figure 12.5 Contour for the proof of Proposition 12.9. 


We prove that as X1,X2, Y > oo, each of 


f(oe® dz 
T; 


for r = 2, 3,4 tends to zero. It then follows as before that 


X1,X2—> 00 


Xp 
lim / f(xe"dx = 271 ¥ 
=X; 


as required. 
Now we carry out the necessary estimates, 


| : | | r¥ : | 
if f(aeX ae = if fX+ it)e™2— idr 
T2 0 


fae 
< —e ‘dt 
0 X2 

A 

X2 


= 


for X2 large. Similarly 


| / f(@e del < — 
Ts ~ Xy 
Also 


| : | | X2 . | 
| / f(2e” dz = - / f+ iyjet" a 
3 —X, 


Xo A 
, —el dt 
-X Y 


= AY—!e-¥(X, + Xp) 


< 


For fixed X1,X2 this tends to 0 as Y — ov. Now let X),X2 — ov. 


As an example, take 


* xen d bees 
iz Giese Or ee 
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which does not satisfy the conditions for (II), but does for (II). The integrand has simple 
poles in the upper half-plane at ia and ib. Calculating the residues in the usual way, 
applying Proposition 12.9, and equating imaginary parts yields 


ca x? sinx T 
dx = b2e—? 2e-a 
im @+Aet Raw! — 


(Equating real parts proves that the corresponding cosine integral is zero, but again this 
is obvious on other grounds.) 


IV: Poles on the real axis. 

If f(z) has poles on the real axis we make ‘indentations’ in the contour by drawing small 

semicircles as in Figure 12.6. Suppose these semicircles have radii €;, €2,.... Then we 

proceed as above, but letting ¢1, £2,... tend to 0 at the same time as R, X1,X2, Y > 0. 
There is a problem here similar to that in case (II): all we calculate is a Cauchy 


principal value 
b xo-€ b 
pf Sf (x)dx = lim. (/ foodr+ | fos) 
a &-> a xote 


for a pole at xo. This again may exist even if 


b x0-€ b 
/ f)jdx = lim, i " f(x)dx + lim, Sf (x)dx 
a Ej > UV Sq &9—> 


Xo+eé 
Y dx 
a 
L hy 
P/ —=0 
-_1 x 


Consequently there is a convergence problem, once the Cauchy principal value has 


does not. Thus 


does not exist, but 


been obtained. Apart from this consideration, the hypotheses and conclusions of Propo- 
sitions 12.6 for case (II) and 12.9 for case (II) remain valid even when poles occur 
on the real axis, provided that © is summed only over the non-real poles in the upper 
half-plane. 


Figure 12.6 Types of contour used when there are poles on the real axis. 
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Figure 12.7 Contour used when there are poles on the real axis. 
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Rather than stating a cumbersome general theorem, we content ourselves with a 


typical example: 


There is a real pole at 0, so we take a contour as in Figure 12.7. 


As before, the integrals along 2,13, and I'4 tend to zero. Since there are no poles 


inside the contour, 
—& ,ix XQ alx iz 
F e€ e € 
lim i: dx +4 / dx 4 : dz| =0 
X1,X2—> 00 -x, * € Xx nk 


ez 1 0° jizr-l 


1 
ate alto 
ve va nN Zz 


n=1 


But 


where @¢ is differentiable. Hence | #(z)| < M in a neighbourhood of 0, so 


ek ; 1 
lim —dz = lim —+ (2) }dz 
Ce \E 


ée>0J/c, Z e>0 
ae 1 . 
= lim (- / —yiee") + lim / o(z)dz 
2>0 0 eet e>0 JC, 
=it 
using the Estimation Lemma. Therefore 
OO glx 
P i —dx =i 
-oo X 
Equating real and imaginary parts, 


CO 
P| a a 


co xX 


© sinx 
P —dx = 7 


=p: & 
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The first integral exists only as a principal value, since cos(x)/x behaves like 1 /x for 
small |x|. But for the second we have 


© sinx . ~® sinx © sinx 
P —dx = lim dx 4 dx 
aéy # e>0\J_o Xx a x 


. © sinx 
= 2 lim ——dx 
e>0Je x 


This limit exists, hence by definition it equals 


ee. 
a | sin 4 
0 XxX 


because sin(x)/x — 1 as x — 0. Hence we can remove the P from the second expression 
above. Further, we have also proved that 


[ sin 1 
0 Xx = 2 


V: f. at 


For integrals of this type we integrate around a contour like that in Figure 12.8. 


In this case, on 13, if we substitute z = —t + 2771, then —X2 < t < Xj, so that 
et X eut2mia 
——dz= / ——,——(—1)dt 
r3 P(e*) -»% oe") 
. —X2 eat 
= e2Tia / - dt 
x, ve") 


Putting t = —x, this equals 


ax 


F X2 e 
Saal | dx 
_x, (e*) 
If @ is such that 


e% 
——dz—>0 as X,X. > @w (r=2,4) 
I. p(e*) 


then we obtain 


2mai a 
(l-e ——dx = 27d 
—co P(e*) 


Figure 12.8 Contour for integrals of type (V). 
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where © is the sum of the residues of e%/? at poles between the lines im z = 0, 
imz= 27. 
For example, consider 
(oe) ex 
>= dak O<a<l 
[ =e | 
The relevant singularities are at im /2,3ia/2, with corresponding residues 


_ laina/2 _ 1,3ima/2 
ih a 


Hence the value of the integral is 


201 
| — e271a 


1 Qiza/2 1 ,3ixa/2 
( xe xe ) 


Putting k = e'“/? this is easily seen to be equal to 
XT 


2 sin(za/2) 


VI: Short cuts. 
You should always be on the lookout for quick methods, other than those given above. 
We give two examples, each of independent interest. 


Example 12.10. We use contour integration to prove that the integral of the ‘bell curve’ 
or ‘gaussian’, which is of importance in probability theory, is 


ee 2 = 
/ e*dx=/7 
—0o 


This result is usually obtained by squaring the integral, rewriting it as 


CO [o,@) CO [o,@) 
(/ oar) (/ oa) = i / et axdy 
—00 —00 —o00 J—00 


and then changing to polar coordinates. For a long time it was thought that no evaluation 
by contour integration existed, but several such proofs are now known. We give one of 
them, from Remmert [15]. Consider 


-2 
e 
TO Ty ta 
with 
fw 
a=(1 +,/% 
Then a? = iz, so 
f@-fE+a= aa (12.10) 


The poles of f are simple and located at the points — sa +na withn e€ Z. 
Now integrate f over the rectangle with corners —r,s,s + iim(a), —r + iim@) with 
r, s > 0. The only pole of f inside this rectangle is at 5 and its residue is 


208 


12.4 


Residues 


1,2 
—qa =, 


res(f,a/2) = ee = WE 


This follows from Lemma 12.4, which states that 
€ ) g(x) 
res(—,x) = 
h h' (x) 
if h has a simple zero at x and g(x) £0. 
Using the residue theorem and (12.10), it is easy to see that 


CO Ss 
/ ee’ de= lim | ede = 2mires(f,a/2)= Ja 
66 rS>00 J_, 


since the integrals along the vertical sides of the rectangles converge to zero as 


r,S—> ©. 


Example 12.11. An integral of great importance in applied mathematics is 
coer 2 
i Me dx (AER) 
—CO 
The function e~? is differentiable throughout C. If y is the rectangle with vertices at 


R,R,R — id/2, —R — id/2, then 
2 
/ e *“dz=0 
y 


As R > ox, the integrals over the vertical edges of the rectangle tend to zero. The other 


ie 2 oe ity figs 
i e~dx= / e@ FAY ay 
—Co —cCo 


ne 2 aye 43 
=| e* elite /4 dy 
—0o 


oer 2 2 a 2 
if ete-* dx = et a | e* dx 
—0o —0o 


The latter integral is a constant; it has just been evaluated in Example 12.10 and equals 
/7. SO 


two converge. Hence 


Therefore 


OO 2 2 = 
/ eM e—¥ dy = ef /4./7 
—O0O 

(Although we can derive Example 12.10 from this by setting 4 = 0, we need Exam- 
ple 12.10 to get the result.) 


Summation of Series 


Residues can also be used to find explicit expressions for sums of suitable infinite 
series, based on the idea that the functions cotzz and cosec 7z have simple poles at 
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z=ne Z.Iff isa function that is differentiable at all z = n € Z then it is easy to check 
that 


f@ 


res (f(z) cot z, n) =’—— 
1 


(-))"f@ 


us 


res (f(z) cosec 7z, n) = 


This suggests a method for summing certain series, as follows. Let Cy be the square 
whose vertices are 


(N + A411) 


parametrised, as usual, in the anticlockwise direction, Figure 12.9. We claim that cot zz 
and cosec mz are bounded on Cy, where the bound is independent of N. 

To prove this, first note that on the two horizontal sides z = x + iy, where |y| > = 
Now 
elt + ew im 


|cotzz| = | ——————_ 
eltz — e7ltz 


Je™| + le" 


jei*e| — joe 


lA 


e-7Y + e7y 
e—7y — ery 


= coth|zry| 
< coth 2/2 
Also 
cosec 7z| = (JZ|el% — e~i72|)-1 
< (gle7” — |) 
= (sinh [zry|)~! 
< (sinh /2)7! 


Figure 12.9 Square contour used to sum series. 
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On the other two sides of the square, z = +(N + ) + it, so 


| cot zz| = |tanhzt| 
ie eo 2mt 


and 
cosec rz| = | sinzz|~! 
= | cos(izt)|~! 
— (cosh |zrt|)~! 
<1 


Hence there is a constant M such that | cot zz| < M and |coseczz| < M for any zon Cn. 
Suppose now that for large enough |z| we have 


A 
fl < ize 


Then we claim that 
x=0 


where & is the sum of the residues of f(z) cot 7 z. By Cauchy’s Residue Theorem, 


f(z) cot mzdz = 271 2N 
Cn 


where Dy is the sum of the residues of f(z) cot 7z inside Cy. As N — 0, clearly Hy > 
&, so it is sufficient to prove that the integral tends to 0. But 


| | A 
| f@ cotmzdz < {MON + 4) 
Cy N 


for large enough N by the Estimation Lemma. As N — 0, this tends to 0 as claimed. 
Usually, & is an infinite series, and since it is zero we can sum certain related series. 
This is best illustrated by an example, and the obvious one to try is f(z) = 1/z’. 
At an integer n 4 0 the function z~ cot zz has a simple pole with residue 1/(n2z:), 
whereas at the origin it has a triple pole with residue —z/3. Hence 


T = 1 
Oo= y= 
3 2 nen nm 


Hence 


which is a theorem originally proved by Euler, using a different (non-rigorous) method. 
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If we use cosec zz instead of cot 7 za similar result applies, allowing us to sum series 
of the form )“(—1)"f(n). For instance, using f(z) = 1/z? and arguing much as above, 
we can prove that 


CO 
ean L = me 
= nr 12 


Counting Zeros 


A rather different use for residues is to calculate the number of zeros of a function that 
is differentiable inside a simple loop. We begin with: 


THEOREM 12.12. Suppose that f is differentiable, apart from a finite set of poles, in a 
domain D containing a simple loop y and all points inside y . If f has no zeros or poles 
on y then 


al f@ 4 dz=N—P 

ni Jy f(2) 

where N is the number of zeros of f inside y and P is the number of poles of f inside y, 
each counted according to multiplicity. 


Proof. By Cauchy’s Residue Theorem, the integral is the sum of the residues of 
f'@/f@) at its poles inside y. If z is neither a zero nor a pole of f, then f’/f is 
differentiable at zo. We show that: 


(i) If f has a zero of order k at z; then f’/f has a pole with residue k. 
(ii) If f has a pole of order m at zy then f’/f has a pole with residue —m. 


To prove (i) note that in that case 


f@=@-a)o@ 
where $(z1) 4 0 and ¢ is differentiable in a neighbourhood of z;. Therefore 


f@Q=ke—-alk Ot &—-a)*o' 


so 
f@  k .¢¥@ 
f@Q 2-2 = o@ 
This has a simple pole at z; with residue k, because $’/¢ is differentiable at z;. This 
proves (i). 
Similarly in case (ii) 


where w(z2) 4 0 and @ is differentiable in a neighbourhood of z2. Therefore 
mt) W@) 


(z—zymtl * (z—z)™ 


i@= 
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NYO) 
f@ _ —-m , W'@ 
{2 z-2 W@ 


which has a simple pole at z2 with residue —m. 


Now sum over all poles of f’/f. 


COROLLARY 12.13. Let y be a simple loop in a domain D such that all points inside 
y are in D. If f is differentiable in D and has no zeros on y then the number of zeros of 
f inside y is 
1 / 
I@q, 


2mi J, F@ 


From this we deduce another important theorem: 


THEOREM 12.14 (Rouché’s Theorem). Suppose that f and g are differentiable in a 
domain D that contains a simple loop y and all points inside y. If 


If(z) — a(2)| < If(2)| (12.11) 
for all z = y() ¢t € [a, b]) then f and g have the same number of zeros inside y. 
Proof. Let F(z) = g(z)/f(z). By (12.11), 
|l-—F(y@)| <1 (té [ab) (19:10) 
Inside D, 


F hasazero <> ghasazero 
F hasa pole <=> f has a zero 


Thus by Theorem 12.12 it is enough to prove that 


To do so, let P(t) = F(y (0). By (12.12), 
11 -T@| <1 (te [a,d}) 


so I’ lies inside the circle centre 1 radius 1, Figure 12.10. Now 
F’ > Fy (t 
[ @e.— f (YO) 
y F@) a Fy) 
’TC 
a V(t) 
_ ff « 


TZ 
= 27iw(T, 0) 


=0 


from Figure 12.10. 
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Figure 12.10 Path I in proof of Rouché’s Theorem. 


As an example of Rouché’s Theorem in action, we give a second proof of the 
Fundamental Theorem of Algebra, Theorem 10.9. Let 


gy = 2" tae” | + tam 


f@=z" 
Then 
f(z) — g(2)| = layz™ 1 +--+ + al 
and for z 40 


am 


ay 
ye = [24-4 
£ Zz 


1 
i 
The right-hand side tends to 0 as |z| — ov, so there exists R > O such that if |z| > R 
then 


and then 
f(z) — g(z)| < |z”| = f(z) 


Therefore by Rouché’s Theorem f and g have the same number of zeros inside {z € 
C: |z| < R}. Since f has m zeros (counting multiplicities) so does g. But g cannot 
have more than m zeros, so every polynomial of degree m over C has exactly m zeros, 
counting multiplicities. This is the Fundamental Theorem of Algebra. 


Exercises 


1. Find the residue of f at zp in the following cases: 
@ f@ =z sinz (#0), 2=0 
(ii) f@ =e"! @ #0), w=0 
(iti) f(z) = exp(1/z) @ #0), zo = 0 
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(iv) f(2) = 2(2 + a*)3 (z  £ia), zp = ia, —ia, wherea € R 
(v) fQ@=A+24+2)7!  F exp (rmi/3), r= 1,2,4,5), 29 = exp (17/3) 

2. Find the residue of the given function at each of its isolated singular points, includ- 
ing infinity (provided this is also isolated — that is, not the limit of a sequence of 
finite singularities). 

O17 2?) 
Gi) e#/(2@? +5) 
(iii) cot? z 
(iv) (sinz~4)7} 

(v) (zcosz~7)7! 

3. If f has a pole of order 2 at zo, show that the residue of f at zo is h'(zo), where 
h() = @ — 20y F@). 

4. Let y(t) =e", (¢ € [0, 27]). Find, by residues, the value of 


/ dz ( 1) 
> a> 
y 2 —2az+1 


20 dt 
[ a— cost 


What happens if a < —1? If —-1<a< 1? 


Hence calculate 


5. Verify the following: 


ms 


. a t 
@) i l+bcos*t  /b+1 
(ii) [ aaa 
il — = —= > 
I+at 2/2 
logx 4 
city ee 5 dx 
Ge [4 x Sin 7X Zax aon 


tae 7 m3 
W) [5 = 


6. Show that 


(b > —1) 


o (10x) 
~co (x? + 4)? + OF 
7. Show that 


[e,@) 


cos 5x a4 5 5a WU 
dx = —Sa//2 : ote an 0 
o x +at 2a. ia roa 4 ean 


10. 


11. 


12. 


13. 


14. 


15. 
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. Evaluate: 


20 
(1) / cos? t + sin* t dt 
0 
20 
(ii) | sin? t- cost-+ cos*f- sint dt 
0 


20 
(iii) / 2cos* t+ 3cos? tdt 
0 


. Ifn € Z,n > 0, prove: 


20 
(i) exp(cos f) cos(mt — sint) dt = 27/n! 


0 
20 
(ii) i exp(cos f) sin(nt — sin t) dt = 0 
0 


Evaluate, by integrating suitable functions round a semicircle: 


of 
: o l+x%+x4 


. °° cos mx 
(it) i >+—, dx (a,m > 0) 
0 


x? a" 
Prove that 
a ig 
dx = >0 
[ (x? + a?) 7 16a? (a ) 
By integrating round a rectangle whose vertices lie at R,R + 1,—R + 1,—R, and 


letting R + ov, show that 


| © cosh(cx) 


—0oo cosh(zx) dx = sec(c/2) (c € [—2,7]) 


Prove that 


oct T 
7 (+ 1)7! dt= —— ae [0,1)) 
0 sin 7a 


by making the substitution ¢ = e* and integrating e““(e? + 1)~! around the rectangle 
with vertices +R, +R + 271. 


Inversion Formula for the Laplace Transform. Suppose that F is differentiable in C 
except for a finite number of poles, of which z, ...,Z, satisfy rez < a and none lies 
on the line rez = a. If there exist M > 0,b > 0,c > O such that |F(z)| < M/|zI° 
for |z| < b, show that 


a+iR n 
f(t) = lim / e F(z) dz = 2mi 2 res(e F(z), z,) 
Roo Jaq—iR a 
If F(z) = a(z? + a7)! (aw > 0) show that f(#) = sinat. 
Use real analysis to prove Jordan’s inequality: 
i 2 
Bre (re [0,2/2) 


“4 
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Hence show that if Sp(t) = Re” (t € [0,2] then 
imz 


lim | ~—dz=0 (m>0) 


R>JSp Z 
By integrating e”“/z along the contours I'|, C:, 2, Sr defined by: 


l;@=t (t €[-R,-e]) 
C(t) =e" (te [0,2] 
To(t)=t (t €[e,R) 


prove Dirichlet’s Discontinuous Factor: 


S644 0 ifm =0 
sin mx ; 
iy dx= 4 2/2 ifm > 0 
Y —x/2 ifm <0 
16. Show that: 
Co 
z z 22° 
i =1 
® ee —] 2 Deiat 
. Low C1%2z 
(il) cosec z = . | d Dae 
oY 
(iii) cosec? z = X @—niye 
17. Sum the following series, where a ¢ Z: 
Co 
@) \\mtay? 
—0o 
[ee 
Gi) Dee +a) 
0 
[ee 
Git) )°Qn+1)* 
0 
Ce 
(iv) (-1)"Qn+ 7? 
0 
18. By integrating 
zel kz 


(a — 2)sin wz 


round a suitable contour, show that 


= nsinbn nw sinba 
Yi" 55 = 55 (|b| < 7) 
ee a—n 2 sina 


19. 


20. 


21. 


22. 


23. 
24. 


25. 


26. 


27. 
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By considering f(z) = 1/(z — €) + 1/z, show that when & ¢ Z, 


[o,2) 


1, 2é 
a a pare. 


Using the result of Exercise 19, integrate m2 cot zx along a suitable contour to show 
that 


CO 
log sinaz = logmz+ a log( — 2 /n’) 
n=1 
where the log is chosen to make log 1 = 0 in each term. Taking exponentials, obtain 
the infinite product expansion of the sine function (defined as the limit of suitable 
finite partial products, by analogy with infinite sums): 


CO 
sina Z = 1Z [a — 2 /n) 


n=1 
If |a| > e, use Rouché’s Theorem to prove that the equation 


e* = az" 


has n roots with |z| < 1. 

Find the number of zeros of the following polynomials that lie inside the unit circle: 
(i) 2? — 226+ 22 —8z-2 

(ii) 2° —2+322-—z+8 

(ii) 24 —5z+1 


How many zeros of z+ + 423 + 6z* — 4z +3 lie inside the disc |z— 1| < 1? 


Prove that however small ¢ > 0 is chosen, for all large enough n the function 
ise ee Ok es i 

has all its zeros inside the disc |z| < «. 

Let p(z) be a polynomial of degree n, and suppose that p(z;) = p(z2) = O with 

z1 4 Z2. Show that there exists a zero of p’(z) within the circle centre F(z +22) and 

radius ral — Z2| cot(z/n). 

Residues in Reverse. Show that the residue of tan?—! wz at z = 5 is (— 1P?n-t, 

for integer p > 0. (Hint: integrate it round the rectangle with genic at —1R,1 — 

iR, 1 +1R,iR, then let R — oo and estimate sizes.) 


Show that 


: °° logx 
dx = 0 
of 2S 


(ii) log x a 
i +22 "| 
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13.1.1 


Conformal Transformations 


In mathematics we often encounter functions that preserve some structure that is of 
interest. For example, in Euclidean geometry rigid motions preserve lengths, angles, 
and areas, while changes of scale preserve the shape (but not the size) of geometric 
figures. Homomorphisms of groups preserve group multiplication. Arithmetic modulo 
n preserves addition and multiplication. Topological transformations preserve connect- 
edness. Conversely, given an interesting class of functions, we can ask what structure 
they preserve. This chapter deals with a property preserved by all differentiable (equiva- 
lently analytic) complex functions, namely: angles between curves. Functions with this 
property are called ‘conformal’. 

The conformal property can be used in two directions. By studying differentiable 
functions, we can prove theorems about curves; by studying curves, we can prove theo- 
rems about differentiable functions. The second technique is of great importance in the 
advanced ‘geometric’ theory of differentiable functions, but only the first falls within 
our present scope. The method has interesting applications to potential theory and fluid 
dynamics, and we outline the beginning of these. We also consider in moderate detail 
several special conformal functions; in particular ‘Mobius maps’, often called ‘MGbius 
transformations’, which have the remarkable property of mapping circles to circles. 


Measurement of Angles 


Since we are studying preservation of angles, we must discuss these first, in a manner 
that is appropriate for this chapter. The use of a real number to measure an angle is not 
always entirely satisfactory, because the same angle corresponds to many different real 
numbers. However, if real numbers 6 and @ represent the same angle, then 6 — ¢ is 
an integer multiple of 277, and conversely, so the ambiguity is not too great. For many 
purposes it can be avoided by making some artificial convention, such as the requirement 
that —z < arg(z) < z. For other purposes it is more convenient to measure angles in a 
natural and unambiguous way, though one that is less familiar than the real numbers. 


Real Numbers Modulo 27 
If x, y € R we say that x and y are congruent modulo 27, and write 


x=y (mod 27) 
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if there is an integer n such that x — y = 2n7. Congruence modulo 27 is an equivalence 
relation, so it partitions R into mutually disjoint equivalence classes. We denote the set 
of all such equivalence classes by 


R/2x 


For each x € R let p(x) be the equivalence class (or congruence class) to which x 
belongs. This defines a function 


p:R—- R/2z2 
and 
PQ) = {x + 2nt :n € Z} 


Given an angle measured by a real number 9, the same angle is measured by all 
6 + 2nz (n € Z), and by these real numbers only. Instead of picking one of them, we 
can represent the angle by the whole collection, namely p(@). In other words, the natural 
measure of an angle is not a real number, but an element of R/27. 

For instance, the real numbers 


...,—l1m/3, —52/3, 2/3, 77/3, 137/3,... 
all represent the same angle, and the set 
{...,-llw/3, —52/3, 1/3, 70/3, 1377/3, ...} 


is the unique element of R/2z corresponding to this angle. 


Geometry of R/2z7 
Geometrically, R/2z is a circle. To see this, define 
q:R-C g(x) = e* (x € R) 
The image of q is the unit circle in C, which we denote by 
S={zeC:z=e" (xe R} = {ze C: |z =1} 


Since e?7! = 1, Proposition 5.3 shows that g(x) = q(y) if and only if x = y (mod 27), 
so 


qx=qy) = p&)=pv) (13.1) 
Therefore we can define 
J: R/2x -S 
as follows: if r ¢ R/27 then r = p(x) for some x € R, and we set 
I(r) = a@) 


By (13.1) this is independent of the choice of x € R, and j is a bijection. Thus the 
elements of R/2z are in natural one-to-one correspondence with the points of a circle. 
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This bijection preserves continuity in a natural manner. If X C C we say that a func- 
tion f : X > R/2z is continuous if and only if the composite function jof : X > S 
is continuous in the usual sense, considering S as a subset of C. This accords with the 
intuitive idea of geometric continuity if we think of R/2z as a circle. (In topological 
terms, we use j to transfer the usual topology of S to R/2z, defining U C R/2z to be 


open if and only if j(U) is open in S. Continuity of maps f : X — R/2z in this topology 
is equivalent to what we have just defined.) 


Operations on Angles 


We can define addition and subtraction of elements of R/27, corresponding to geo- 
metric addition and subtraction of angles. Let r,s € R/2z. Pick x,y € R such that 
P(x) =r, p(y) = s, and define 


r+s= p+ y) 
r—s=p(x— y) 


As usual we can verify that these definitions do not depend on the choice of x and y, 
using (12.1). 

We can restate the above in group-theoretic terms. The set G = {2mm : n € Z} is 
a subgroup of the additive group of real numbers, hence a normal subgroup since the 
latter is abelian. The quotient group R/G is what we have called R/27. As an analogy, 
consider the definition of the integers Z, modulo n as the quotient group Z/H where 
H = {kn: ke Z}. 

In Z, we can also define multiplication. You should verify that this does not work for 
R/2z. It fails because 27 is not an integer. However, we can sensibly multiply an angle 
in R/2z by an integer, since this reduces to repeated additions or subtractions. 


The Argument Modulo 27 
For z € C \ {0} we can now define a ‘mod 27’ version of arg z, namely 
arc z = p(arg z) € R/2z 


The notation is deliberately chosen to resemble ‘arg’, and also reminds us that angles 
are involved. The advantages of ‘arc’ over ‘arg’ are twofold. First, there is no ambiguity. 
Second, and more important, the function 


arc: C \ {0} > R/27 


is continuous. This is false for arg, for as z moves across the negative real axis, arg z 
jumps from near z to near —z, because of the convention that —z < argz < a. In 
contrast, since p(—z) = p(zr), this defect is not shared by arc. In fact, if z = re” r > 0, 
then j(arc(z)) = e!. Using this, or directly, it is not hard to verify that 


arc(z1z2) = are z] + are 72 (13.2) 


for all z1, z2 € C\ {0}. Again if we use arg this is true only up to integer multiples of 277. 
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Thus we have two distinct ways to represent an angle: as a uniquely defined ele- 
ment of R/2z (an equivalence class of reals modulo 277), or as a real number chosen 
from this class, unique only up to integer multiples of 27. Both are useful, and there 
are advantages in passing between them at will. We do this in the proof of our next 
result. 


LEMMA 13.1. If y : [a,b] > C is a path and y'(to) exists and is non-zero for some 
to € [a, b], then y has a tangent at z) = y (to) making an angle arc y'(to) with the real 
axis. 


Proof. Let y(t) = x(t)+iy(t). Then the required angle 0 belongs to the congruence class 
modulo 27 of 


tan™!(y'(to)/x'(to)) = arg(x (to) + iy'(to)) = arg y'(to) 


Taking congruence classes, we get 


0 = arc y'(to) 


In future we will be less explicit about passing between R and R/2z. 


DEFINITION 13.2. If v1 and y are two paths meeting at z = yi(t1) = y2(t2) having 
derivatives y;(t1), Y;(t2) # 0, then the angle between y; and y2 at zo is 


6 = are y{(t1) — arc y3(h) € R/2n 


as in Figure 13.1. 


Figure 13.1 Angle between two intersecting curves. 


Conformal Transformations 


In this section we consider functions f : D — C, where D is a domain. It is conve- 
nient to distinguish the two copies of C by using (x, y) as coordinates in D and (u,v) as 
coordinates in the image C. As usual we let z = x + iy, and we set w = u + iv. (The 
classical notation for complex functions was w = f(z) before set theory came along.) If 
f is differentiable on D we have 


f(x + iy) = u(x, y) + iv@, y) 
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where u and v are real-valued functions of two real variables x, y. Hence f defines a 
function from the subset D of the (x, y)-plane to the (u, v)-plane. A smooth path y in D, 
with 


yQH=xO+yO (te [a,b)) 
is mapped by f to a path 
fy) =f(vO) = ua), yO) +A, yO) (¢ € [a,b]) 


in the (u, v)-plane. 


REMARK 13.3. Until now we have written composition as f o y, but from now on we 
omit the circle in formulas when there is no confusion with the product. This avoids 


undue proliferation of circles. 


Suppose that z) = y(fp) and y’(f9) # 0 for some fo € [a,b]. By the chain rule, 
(fv) (to) =f’ (v (to))v' (to) = f' Zo) (to) 


Therefore 


are ((fy) (to) = are (f'(zo)y'(to)) 
= arcf’(zo) + arc y'(to) (13.3) 
by (13.2). 
Suppose that and y2 are two paths through zo, say zo = yi(t1) = y2(f2). Then (13.3) 


implies that fy, and fz meet at the same angle as y, and y,. Geometrically this is clear 
since both tangents are turned through the same angle arc f’(zo). Alternatively, compute: 


are ((fy1)'(t1)) — are ((fy2)'(t2)) = are f’(zo) + are yj(t1) — are f(z) — are y3(t2) 
= are yj; (t)) — are y3 (ty) 


DEFINITION 13.4. A function f : D — C that preserves angles between paths at a 
point zo is conformal at Zo. 
If f is conformal at all zo € D we say it is conformal. 


The terms conformal function, conformal map, conformal mapping, and conformal 
transformation all mean the same thing. The fourth is traditional, the third is going out 
of fashion, the second is convenient, and the first agrees with much current terminology. 
One advantage of the fourth is that it focuses attention on how paths and other geometric 
figures transform under f. 

Observe that we have proved: 


THEOREM 13.5. Let f : D > C be differentiable. The f is conformal at all z € D 
such that f'(zo) # 0. 


If f(a) = 0 then f may not be conformal at zo. For example, if f(z) = z* then the 
positive half of the real axis and the ‘positive’ half of the imaginary axis ({iy : y > 
O}) meet at right angles. They transform into the positive half of the real axis and the 
negative half of the real axis, with an angle of z. 


13.2 Conformal Transformations 
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In fact, if z) is a zero of f’ of order m then the angle between paths meeting at z is 


multiplied by m+ 1 on transforming by f. 


We can find out a little about how f affects lengths. If zo, z € C and f is differentiable 
at zo, then the ratio of the distances between f(z) and f(zo) and between z and Zp is 


f(z) —f@o)| _ | F@) —F@o) 


lz — zol Z—Z0 


which tends to f’(z) as z —> Zo. So near zg distances are approximately multiplied by 


Lf'(zo)I. 


Some special cases illustrate the previous analysis. 


Example 13.6. f(z) = z°. 
Here 
u(x, y) +iv(x,y) = @+iy)? 
= (x? — 3xy”) + i(Gx"y — y’) 


so that 


u(x, y) = eS 3xy? 
v(a,y) = 3x°y — y 


Consider the paths 


vit) =1+it 
y2o(t) =t+i 


These are respectively the lines x = 1,y = 1, which meet at right angles. The paths 


ly =fy and [2 = fy2 are given by 


ri(@) =( — 37) +iGr—2) 
Ta) = ( — 30) +132 —1) 


These curves are sketched in Figure 13.2. As expected, I) and I'2 meet at right angles 


at (—2, 2). 


Example 13.7. f(z) = 1/z. 
This time 


Wey) = zo 


vony) = a 


If c > 0, the circle 


y(t) = ce” (t € [0,27]) 


(13.4) 
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Figure 13.3 Images of circles centred at the origin for Example 13.7. 


transforms into 
Te) =c7te™ (t¢ € [0,27]) (13.5) 


Hence the system of concentric circles (13.4), as c varies, maps into the system of con- 
centric circles (13.5), Figure 13.3. Further, the lines y = kx (k € R) through the origin 
are given by 


Og (t) = t + kit 
and transform to 


1 1 ik 1 


An) = (0+ kit = T4357 eae 


which also represents lines through the origin. Now y, and 4, all meet at right angles, 
and so do their transforms I’. and Ax, Figure 13.4. 


Example 13.8. f(z) = sinz. 
Here 


u(x, y) = sin xcoshy 


v(x, y) = cos x sinhy 
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Figure 13.4 Paths for Example 13.7 and their transforms all meet at right angles. 


Figure 13.5 Example 13.8: Cartesian coordinate grid transforms into a system of confocal 
hyperbolas and ellipses. Again all curves meet at right angles. 


The lines x = c (c € R) transform into confocal hyperbolas 


=1 


sin?c cos? 


and the lines y = d (d € R) transform into confocal ellipses 


a Vv 


= 1 
cosh*d sinh? d 


as in Figure 13.5. The two systems of straight lines form a grid in Cartesian coordinates, 
crossing at right angles. The hyperbolas and ellipses also meet at right angles, except at 
points where f’(z) = 0, namely f(z) = +1. These are the common foci of the ellipses 
and hyperbolas. 
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Critical Points 


Theorem 13.5 proves that a differentiable complex function is conformal at any point zo 
such that f’(zo) 4 0. We now consider what happens when f’(zo) = 0. 


DEFINITION 13.9. Let f be a differentiable complex function. A point zo is a regular 


point if f’(zo) # 0. It is a critical point or singular point if f'(zo) = 0. 


Even though a complex function f need not be conformal near a critical point, the 
local geometry still has considerable structure. To unravel what happens at a critical 
point, consider the local Taylor expansion: 


Fo + h) = fo) + hf" Go) ++ + WFO Go)/nl + 


This consists of two parts. The constant part is wo = f(zo), which calculates where the 
point zo in the z-plane is mapped in the w-plane. The variable part 


Feo + h) — flo) = hf’ o) + + + HF Eo)/nl + 


specifies how the function behaves near wy) = f(z). In particular, the behaviour is 
controlled by the first non-zero derivative f(z). This motivates: 


DEFINITION 13.10. A differentiable function f : D — C is of order n at zo € D, 
(where n > 1) if f(zo) £0, and f(z) = 0 forl < k <n. 


A function of order 1 satisfies f’(zo) 4 0, and in general, the Taylor series at a fixed 
point zo for a function of order n has the form 


f(zo +A) =f (zo) + A"f (zo) /n! +++ (13.6) 


Examples 13.11. 
(i) f(z) = 2" for integer n > 1 is of order n. 
Gi) sing =z—2°/3!+--- is of order 1. 
(iii) cosz—1=z*/2!—24/4!+.--- is of order 2. 


Equation (13.6) indicates that the behaviour of a function near a critical point is given 
by the order of the corresponding zero of the derivative. Indeed, a function of order 
n > 1 can be written as: 


feo +h) =f (zo) + kh” + higher order terms in h, where k = f(zo)/n! A 0 
This gives an approximation of practical value in applications: 
f(2o +h) —f(zo0) © kh” for suitably small h (13.7) 


The size of h required to give a decent approximation depends on the context. As z 
moves from zy —h to z) +h, the difference f(z) — f(z) in (13.7) changes approximately 
from k(—h)" to kh". The behaviour depends on whether n is even or odd. For n odd, 
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it moves from —kh" to kh", passing through zero, and for n even it travels from kh” to 
zero, then back again to kh”. 

A more precise sense of the behaviour can be obtained by writing A in polar 
coordinates, h = re. For suitably small h, (13.7) gives 


f(zo + re?) — f(zo) = kre +--- (k= Fo) /n) 
so 
Lf(zo + rel?) — f (zo)| © kr” 
and 
arc (f(zo + re?) — f(zo)) ¥n@+a_ where a = arc (f™(zo)) 


For small values of h, f transforms a tiny portion of the z-plane near zp to the w-plane 
near wo = f(z), scaling r to kr” and rotating the original angle 0 to n6 + a, Figure 13.6. 

Rotating the line from zo to zo + / in the z-plane through a further angle ¢ rotates the 
corresponding line in the w-plane to n(@ + @) + a, so the angle between the trans- 
formed lines in the w-plane is nd. The sector of the circle radius r, angle @ in the 
z-plane is scaled to a sector in the w-plane, radius kr”, with angle expanded to nd, 
Figure 13.7. 

For n = | the point is regular, and f is conformal. Forn > 1, the function f transforms 
Zo to Wo = f (Zo), and a point distance r from Zo is scaled approximately to kr” from wo 
in the w-plane, while the angle between two paths through z) changes from @¢ in the 
z-plane to n@ in the w-plane. 


Azo+h) 
eas rotate to 
zoth f to kr” nO +a 
VL 
E for suitably Wo 
0 small h 
w-plane 


Z0 
z-plane 


Figure 13.6 The tangent map translating, rotating and scaling from zg to wo. 


Figure 13.7 Transforming a sector of a circle centre zp, radius r. 
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Mobius Maps 


For fixed a, b, c,d € C, the function 


d 


is called a Mobius map or bilinear map. (Again, ‘map’ can be replaced by ‘function’, 
‘mapping’, or ‘transformation’ .) These maps have a number of important properties, and 
we discuss some of them. 
First, note that f is differentiable for z 4 —d/c, and 
ad — bc 


es rae a 


since ad — bc # 0, again provided z 4 —d/c. Hence f is conformal throughout the 
domain C \ {—d/c} on which it is defined. 
Now suppose that 


f@= - _ E. Gs= be 20) 


Az+B 
f= C+D (AD — BC £0) 


is a second Mobius map. Then the composite 
(Aa + Bc)z + (Ab + Bd) 
(Ca + De)z + (Cb + Dad) 


ef (2) = 


is another Mobius map, because 
(Aa + Bc)(Cb + Dd) — (Ab + Bd)(Ca + Dc) = (AD — BC)(ad — bc) #0 


So composing two MGbius maps always gives a Mobius map. 


Mobius Maps Preserve Circles 


A remarkable, and useful, property of such maps is that they transform circles (or 
straight lines) into circles (or straight lines). To save breath, let us agree that in this 
section a ‘circle’ may be either a circle or a straight line. (We can think of a straight 
line as a circle of infinite radius, remembering that oo is a fairly respectable concept in 
complex analysis.) 

Suppose p, q € C with p #q, and letk > 0. The equation 


k-Pl_, 
Iz — ql 
is satisfied by those points whose distances from p and q are in the ratio k. It is well 
known in Euclidean geometry, and can easily be checked using coordinates, that if k # 1 
such points lie on a circle, and if k = 1 they lie on a straight line. 
Here is a quick proof. By scaling and a rigid motion, we can choose coordinates in 


(13.8) 


the plane so that p = (0,0) and g = (1,0). If (x, y) is distance kd from p, and distance d 
from q, then 


(x +y = ky) 1) + y4 
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Hence 


P+y=R0' +y —2x41) 


k2 Oe ae k2 k2 = 
oS eit f eat Veet) 


which is the equation of a circle. 


which implies 


If we put 
+b 
w=f@) ==" (ad - be £0) 
cz +d 
it is easy to verify that 
—dw+b 
 cw-a 
Now (13.8) takes the form 
md 2 | = 
a 
cetd — 4 
which simplifies to 
—P 
Wo ae (13.9) 
Iw —Q| 


where 


P = (b+ pa)/(d+ pc) 
Q = (b+ qa)/(d+4qc) 
K =k\d+qe|/|d + pe| 


Hence (13.9) also represents a circle. This proves that the Mobius map f transforms 
circles into circles. 


Classification of Mobius Maps 


The above calculation, while it verifies the assertion, is not as instructive as we might 
hope, and the following approach has its advantages. We consider several special types 
of Mébius map that have especially simple forms, which can easily be seen to trans- 
form circles into circles. Then we show that a general Mobius map can be obtained by 
composing these types. 

It is perhaps not so surprising that this approach works, in view of the composition 
property of Mobius maps. The decomposition can be seen as analogous to the well- 
known fact that any rigid motion of the Euclidean plane can be obtained by composing 
a translation, a rotation, and a reflection. 

We begin with the special types. 
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Translation: w = z+ k(k € C). This corresponds geometrically to moving points 
rek to the right and imk upwards, and clearly preserves the shape of geometric 
figures, in particular circles. 

Rotation: w = ez (6 € R). All points rotate round the origin through angle 0. Again 
this preserves the shape of geometric figures. 

Magnification: w = hz(h > 0). This produces a change of scale. If h < 1 it shrinks 
rather than magnifies, but such pedantry is irrelevant and we still use the term 
‘magnification’. It therefore maps geometric figures to similar geometric figures. 

Inversion: w = 1/z. Devotees of ‘inversive geometry’, now largely out of fashion, 
will recognise the corresponding ‘geometric inversion’, which is well known to 
preserve circles. However, circles can map to straight lines, or straight lines to 
circles. The rest of us can check this by a coordinate calculation similar to the one 
above. 


We can now state: 


THEOREM 13.12. Every Mobius map can be obtained by composing a translation, an 
inversion, a magnification, a rotation, and another translation. 


Proof. Suppose that a,b,c,d € Cand ad — bc # 0, where c # 0. Define 


t\(z) =z+d/c (translation) 


j(z) = 1/z (inversion) 


ad — ae 
mz) = | 5 z (magnification) 
c 
be— 
r(Z) = en a z (rotation) 


tn(z) =z+a/c (translation) 


It is routine to verify that 


az+ 
(tarmjty)(z) = ——— 
czt+ 


If c = 0 define 


t(z)=z+b/a_ (translation 


mz) = Is|z (magnification) 


r(z)= 7 i z (rotation) 
d|\a 
noting that ad — bc € 0 implies ad # 0 since c = 0, hence a 4 0,d 4 0. Now 
az+b 
(rmt) )(Z) = —— 
cz+d 


COROLLARY 13.13. Every Mobius map transforms circles into circles. 
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There are other functions f : C — C that preserve circularity, the most obvi- 
ous being complex conjugation (which is not differentiable). A theorem of Constantin 
Carathéodory asserts that every such function is either a Mébius map, or a Mobius 
map composed with conjugation. No differentiability assumptions are needed for this 
theorem. We do not prove it here. 


Extension of Mobius Maps to the Riemann Sphere 


There is a natural way to extend Mobius maps to the Riemann sphere by examining their 
behaviour as z > oo. 


PROPOSITION 13.14. Any Mobius map can be extended uniquely to the Riemann 
sphere so that it is differentiable at every point. 


Proof. By continuity we can find this extension by letting z — oo, but we must then 
verify differentiability. If 


az+ 
{@= (ad — bc # 0) 
cz+d 

then letting z > oo we must define 

SRD 

— ifc 

feo)=) ¢ 
coo ifc=0 


Differentiability at oo is an easy exercise. 


When extended in this manner, f is a bijection from C U {oo} to itself. If we identify 
CU {oo} with the Riemann sphere S’, then circles in C remain circles on S?, and straight 
lines in C also become circles on S?. 

It is instructive to work out the effect of the basic types of Mébius map on S?. We 
omit the details. 


Potential Theory 


We now turn — briefly — to classical applications of complex analysis. These were instru- 
mental in convincing mathematicians that the subject was worth taking seriously, before 
the basic concepts had been made rigorous. We discuss potential theory, a general area 
of mathematical physics with applications to gravitation electrostatics, magnetism, and 
fluid flow. 


Laplace’s Equation 


The two-dimensional Laplace equation 


a ao _ 


—++=5 =0 
ax? ay’ 
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for a function (x, y) is important in potential theory, with applications in particular 
to fluid dynamics. It is closely connected with complex function theory, as we now 
demonstrate. Let f : D — C be differentiable, with z = x + iy, and write 


f(2) = u(x, y) + v(x, y) 


as usual. Then 


FO= ou | ,av _ dv , ou 
Ox ox dy dy 


as in Section 4.2. By Theorem 10.3 f” exists throughout D. If we let f’(z) = U+iV then 
the Cauchy—Riemann Equations of Theorem 4.12 state that 


dU _ aVv dV aU 
Ox dy ox dy 


Therefore 


SO We get 


eu d (du\ OU dV (du) | 07h 
ax2 ax \ ox ax sy dy \ dy dy 
Therefore u(x, y) satisfies the Laplace equation. Similarly, so does v(x, y). 
For instance, consider f(z) = ze*. Then 


u(x, y) = xe* cos y — ye* sin y 
v(x, y) = ye’ cos y + xe” sin y 


and it may be verified directly that these functions satisfy Laplace’s Equation. 
Solutions of Laplace’s equation are called harmonic or potential functions. Pairs of 
functions u,v obtained from a differentiable function by the above method are called 
harmonic conjugates. 
The lines u = constant, v = constant, are orthogonal (mutually perpendicular) in the 
(u, v)-plane, so by conformality the curves 


u(x, y) = constant 


v(x, y) = constant 


are orthogonal in the (x, y)-plane. In potential theory, if wis harmonic, the lines u(x, y) = 
constant are called equipotential lines, and the set of orthogonal curves v(x, y) = con- 
stant are called streamlines. In the case when Laplace’s equation describes fluid flow, 
streamlines are the paths along which the fluid flows. If we are given u in a domain D 
and wish to find the streamlines given by v, we can often use complex integration. For a 
fixed point zo € D, and any z; € D, we have 


1 (du .du 
ftay= | fede = | (= -i ae 
z0 z \Ox dy 
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For example, if u(x,y) = x” — y*, which is harmonic, then picking zy = 0 for 
convenience we have 


Z1 2 
2zdz = Zz} 


Z1 
flay= [ Qr+ 2nd = [ 
0 0 


Hence f(x + iy) = (x + iy)* = (x* — y*) + i(2xy). So the streamlines have equations 
2xy = constant, or equivalently 


xy = constant 


Often, as in this case, we can guess what f, hence v(x, y), ought to be — a process dignified 
by the term ‘inspection’. 


Design of Aerofoils 


Conformal maps and complex function theory played a part in the design of aircraft 
in the early days of aviation (and, in a more sophisticated way, still do, although most 
modelling is now done using the numerical methods of computational fluid dynamics). 
In particular, the transformation from the z-plane to the w-plane given by 


w—2 _ z—1 ‘ 
wt2 \z+l1 


maps a circle in the z-plane, passing through —1 and containing +1 in its interior, into 


a ‘bent teardrop’ shape as in Figure 13.8 that resembles a cross-section of an aircraft’s 
wing. This is known as a Joukowski aerofoil, defined by a Joukowski transformation, 
after Nikolai Zhukovsky, who discovered the transformation. 


Figure 13.8 Left: Flow past circle (with appropriate circulation added). Right: Circle transformed 
to Joukowski aerofoil, with transformed flow. 


It is used as follows. It is quite easy to solve the Laplace equation and find streamlines 
for the flow of a fluid round a circular disc. Now apply the Joukowski transformation: 
the disc maps to an aerofoil, and the streamlines round the disc map to the streamlines 
round the aerofoil. In order to make the flow behave in a sensible physical manner at the 
sharp edge of the aerofoil, the flow past the circle is modified by adding a ‘circulation’ 
term that flows round the circle. From this we can calculate properties of the flow; in 
particular, the amount of ‘lift’ imparted to the aircraft. More subtle transformations, such 
as the Karmann—Trefftz transform 
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w—n z-1\" 
w+n \c+l1 
give more accurate information. 


Exercises 


1. Sketch the image of the set D = {z € C: rez > 0,imz > O,|z| < 1} under 
the conformal map f(z) = 1/z. Draw the images of the lines in D parallel to the 
coordinate axes. 


2. Show that the map 


1+k 1 
fo=¥ ~ exp ( i(5 tan“! k) 2) 
transforms the strip between the lines y = kx, y = kx + b into that between x = 0 


andx = 1. 


3. Find a conformal map f of the annulus 2 < |z| < 5 on to the annulus 4 < |z| < 10, 
such that f(—5) = 10. (These points lie on the boundary so f must also be defined 
there.) Find another with f(5) = 4. 


4. With a suitable choice of the square root, show that 
f@ = Jz —p —i/p 


maps the exterior of the parabola y7 = 4px (p > 0) onto the right-hand half-plane 
x > 0, Figure 13.9. 


5. Find a conformal map that sends the interior of the right-hand branch of the 
hyperbola 


PH-y~ar 


to the upper half-plane y > 0, Figure 13.10. (Hint: what is rez*?) 


6. Show that the semicircle |z| < 1,rez > 0 is mapped by 


fQ=L24+z 


Figure 13.9 Conformal map of interior of parabola to half-plane. 
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Figure 13.10 Conformal map of interior of hyperbola to half-plane. 


Figure 13.11 Conformal map for Exercise 6. 


Figure 13.12 Region for Exercise 7. 
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to the region bounded by the parabola x = —y” and the curve whose equation in 


polar coordinates is 
r = 2cos(@/3) (|O| < 37/4) 


See Figure 13.11. 


7. Show that for suitably defined square roots, when a,b,c > 0, 


V24+ 24+ Ja + 7 


ON ee =e 


maps the plane with the segments between —a, and b, and —ic and ic removed, to 


the upper half-plane. See Figure 13.12. 
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-l+i i 1 


+i a 
-l1 1 = 
1 
-l-i =i 1-i 


Figure 13.13 Left: Region for Exercise 8. Right: Exterior of unit circle. 


Figure 13.14 Region for Exercise 9. 


8. Show that for suitably defined square roots, 
1 _—— 
f= Fae (Wao ver 5) 
2+V5 


maps Figure 13.13 (left) to the exterior of the unit circle, Figure 13.13 (right). 


9. Find a conformal map from Figure 13.14 to the upper half-plane. (Hint: compare 
Exercise 7.) 


10. Show that 


cos7z— coshzh 
fO= / 


1+cosxz 
maps Figure 13.15 to the upper half-plane. 
11. Find a conformal map from Figure 13.16 to the upper half-plane. 


12. Find the Mobius map that sends —1, oo, i respectively to: 
(i) i,1,1+i 
(ii) 00, i, 1 
(iii) 0,00, 1 
13. Find the general form of a Mébius map that: 
(i) Transforms the upper half-plane to itself. 
(ii) Transforms the upper half-plane to the lower half-plane. 
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Figure 13.15 Region for Exercise 10. 


Figure 13.16 Region for Exercise 11. 


(iii) Transforms the upper half-plane to the right half-plane. 

(iv) Preserves the unit circle. 

(v) Preserves the coordinate axes. 

(vi) Transforms the upper half-plane to the interior of the unit circle. 


14. Invariance of Cross Ratio. The cross ratio of four complex numbers z1, 22,23, 24 1S 


(Z1 — Z2)(23 — 24) 
(z1 — 24)(Z3 — Z2) 


Let w(z) = — be a Mobius map with ad — be 0, and let w; = yu(z;) for 


cz 
j= 1,2,3,4. Prove that 


(wi — W2)(w3 — wa) _ (Z1 — Z2)(Z3 — 24) 
(wi — wa)(w3 —w2)  (z1 — 24)(z3 — 22) 


(Hint: first work out how w; — w2 relates to z1 — 22.) 


15. Let f be a Mobius map. A fixed point of f is a point z such that f(z) = z. 
Prove that f has at most two distinct fixed points, including oo. 
If f has a unique fixed point (including oo) it is said to be parabolic. Prove it can 
then be written in the form 


i _ 4 
f@-2 2-20 


th (% #00) 
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16. 


17. 


18. 


19. 


20. 


or 


f(j=z+h (atranslation) 


If f has two distinct fixed points z;, z2, show that it can be written in the form 


C8 Noa © OU Gore 


f(@) — 22 eo) 


(21,22 # ©) 


or 
f@)— 2 =kE—z2) = 00) 
Such a map is said to be hyperbolic if k > 0, elliptic if k = &* (a 0), and 
loxodromic if k = ae”, where a # 1 is real and a # 0. 
With the definitions of Exercise 14, prove the following: 
(i) Any MOébius map (az + b)/(cz + d) is equal to one for which ad — bc = 1. 
(ii) Having ensured this, if a + d is real then the map is elliptic if |a + d| < 2, 
hyperbolic if |a+ d| > 2, and loxodromic if |a + d| = 2. 
(i) If a+ dis not real, the transformation is loxodromic. 
Show that 
u(x,y) = — 3xy" 


is harmonic. Find a harmonic function v : R? — R such that 


f@ +1y) = uG@, y) + iv, y) 
is a differentiable complex function. Prove that v is unique up to the addition of a 
real constant. 


For f(z) = 1/z, write f(z) = u(x, y) + iv(@x, y). Sketch the level curves u(x, y) = 
constant, v(x, y) = constant. If a level curve of u meets a level curve of v, what is 
the angle between them? Is there an easy way to see this, and if so, what is it? 


Find the most general cubic polynomial 
u(x, y) = ax? + bx*y+cxy?+dy> (a,b,c,d € R) 


that is harmonic. Find a differentiable function with u as its real part. 


Verify that the Joukowski transformation does, as claims above, give rise to an 
aerofoil shape. Look up pages 131-134 of Kyrala [12] and see how to compute 
streamlines round it. A more modern book is Anderson [1]. The NASA applet at 
www.egrc.nasa.gov/W W W/K-12/airplane/map.html is also worth investigating. 
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When Weierstrass started his programme to rigorise complex analysis he based it on 
power series. Because these have well-behaved convergence properties, and can be 
differentiated or integrated term by term, they provide a tool of great technical value. 
However, this tool has limitations, which we illustrate in the final section of this chap- 
ter: many important functions cannot be represented by a single power series — mainly 
because power series converge on discs. 

This limitation can be overcome by the method of ‘analytic continuation’, which lets 
us extend the domain of a complex function, subject to suitable conditions. It turns out 
that such an extension is not always unique, and the problem of describing the different 
possibilities and how they are related leads to a remarkable geometric concept, known 
as a ‘Riemann surface’ after its inventor. In this chapter we discuss these topics and 
related ones. Here we use the term ‘analytic’ rather than ‘differentiable’, to emphasise 
the power series viewpoint — but you should remember that in the complex case these 
terms are synonymous. 


The Limitations of Power Series 


We illustrate the problem for the function 


(Y= : 
fOQ= Ta 


It would be possible to use a simpler example, such as 1/z, but it is more appropriate 
to work with something a little less special, where the general problem is more sharply 
defined 

This function is analytic in C \ {—1, 1}, and has simple poles at —1 and 1. Its power 
series expansion about z = 0 is 


Lee ag Ee ex (14.1) 


which converges for |z| < 1 but diverges for |z| > 1. Thus if we require functions to be 
defined on domains, the series (14.1) at best represents f on an open disc 


Dy ={zeEC: |z| < 1} 


Thus (14.1) tells us about a small part of f (directly, at least, although it has useful 
indirect implications, as we shall see). 
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We can begin to get round this problem by choosing a different zo, for example zg = i. 
To find the Taylor series for f(z) around zo = 1 it helps to rewrite f in the form 


1/1 1 
f= 5 (7 | =) 


Let w = z—1, so that z= w +1. Then 


1 1 | 1 
F@) 5 (; w+i1l—w -) 

-1 -1 
ag “) ! (4 “| 
2)1+i1 T+i 1-i 1-i 

1 am i w \” 1 = w “s 
= 505 4 2 (5) eae, 


Satan os 
~ 402 Li +i) “1-i\i-i ‘ 


The series (14.2) has radius of convergence ./2, because that is the distance from z = i 


to the nearest pole of f, so it converges on 
Do = {ze C: |z—i| < V2} 


Figure 14.1 shows that (14.2) converges for some values of z for which (14.1) converges, 
and conversely. Thus not only do (14.1) and (14.2) represent small parts of f (namely its 
restrictions to D,, D») but they represent different parts. 
Similarly if we take z9 = 2 we obtain a third series 
CO 
FO = Yo|-3C" + H-P"] @-B" (14.3) 


n=0 


convergent on the disc 


D3 = {zEC:|z—2| < 1} 


Figure 14.1 Discs of convergence of various local power series for f. Shading shows intersections 
of discs. 
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which represent yet another part of f. And something of this kind happens for any choice 
of zo: the Taylor series 


fO=> ae—ay (14.4) 
n=0 


converges for 
IZ — Zo] < K = min(|Zo — 1], |zo + 1) 


by Theorem 10.3. Thus no single choice of zp gives a power series expansion of f(z) 
valid for all z € C \ {—1,1}, even though f is analytic on this domain. (You should 
check that Laurent series do not improve the situation.) 

This problem is more a limitation of a tool — power series — than a defect of ana- 
lytic functions. It is no fault of f(z) that our clumsy attempt to represent it using power 
series has apparently failed. The fact that a power series converges on a disc, so often a 
help, becomes a hindrance when we look at functions defined on domains that are not 
discs. 

We could give up at this stage and ignore power series, but that would hardly be 
enterprising: it is a cardinal principle in mathematics not to throw away a good idea 
just because it does not work. If a single power series is no good, why not try a whole 
collection of power series? This solves part of the problem. We can certainly find, for 
each zo € C \ {—1, 1}, a power series expansion (14.4) around zo that converges to f(z) 
for all z near zo. Thus using several power series gives information about the whole of f. 

However, it also creates another, more serious, problem. In the discussion so far 
we started with f and worked out the power series. Weierstrass needed to proceed in 
the opposite direction: use power series to define f(z). In the example above we know 
that the series (14.1), (14.2), and (14.3) represent the same function f, because that is 
how they are constructed. If we are given two power series expansions around differ- 
ent points, how can we tell whether they represent the same analytic function, without 
knowing a priori what that function is? This is the central problem with Weierstrass’s 
approach. 


Comparing Power Series 


We wish to compare two power series 


p(@) = >> palz — 20)" 
n=0 


q@) = )> an(z — 21)" 
n=0 


around zo, Zz) respectively, convergent on open discs P, Q. Everything is easy if P and Q 
overlap (that is, P™ Q 4 @), as is the case for (14.1) and (14.2), or (14.2) and (14.3), 
see Figure 14.2. In this case we can prove: 
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Figure 14.3 With more singularities, chains of discs may have to be longer. 


LEMMA 14.1. Let f, g be analytic on a domain D. Suppose that P and Q are open sets 
and 6 #4 PM Q C D. Suppose that p(z) = q(z) for all z ¢ PQ, and f, g are analytic 
functions defined on D such that f(z) = p(z) for z € P and g(z) = q(z) for z € Q. Then 
f( = g(2) forall z € D. 


Proof. For z in the non-empty set PM Q, 


f(2) = p(@) = g(z) = a2) 


Now Theorem 10.16 implies that f(z) = g(z) for all z € D. 


In this sense, p and g represent the same analytic function. Obviously the converse 
also applies: if p and q represent the same analytic function on D then they must agree 
on the overlap. 

The existence of a non-empty overlap allows direct comparison of the two power 
series: all we have to do is look at their values. But what if the discs of convergence do 
not overlap, as for (14.1) and (14.3)? The answer is to construct a chain of overlapping 
discs from one to the other, on each of which the power series converges, and such 
that the power series agree on all overlaps. Thus D; and D2 overlap, and (14.1) agrees 
with (14.2) on D,; ND 2; and D> and D3 overlap, and (14.2) agrees with (14.3) on D2ND3. 

It is fairly clear that for f(z) = 1/(1—z) we can get from Dj to any point zo 4 +1 bya 
chain of at most three discs (two unless Zo is real and |zo > 1|). For a more complicated 
function with more poles (or other singularities) we may need more discs in a chain, 
because the discs have to ‘push between’ the singularities, see Figure 14.3. In a sense 
this is the reason for the limitation of power series: discs are too blunt an instrument to 
penetrate beyond the singularities into more distant territory. 
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When we formalise these ideas, as in the next section, it becomes clear that the restric- 
tion to power series is inessential. This is often the way in mathematics: the solution to a 
special problem turns out to apply in a much more general setting. The special problem 
plays the useful role of a psychological springboard, hurling our thoughts into higher 
realms. 


Analytic Continuation 


The simplest type of analytic continuation is direct analytic continuation, extending a 
function from one domain to an overlapping domain. Repeating this process we get 
indirect analytic continuation. We consider these in turn. 


Direct Analytic Continuation 


DEFINITION 14.2. If f, is analytic on a domain D, and fp is analytic on a domain Do, 
where D| N D2 ¢ @ and f(z) = fa(2 for all z € Dj N Do, then fp is a direct analytic 
continuation of f; to the domain D2, see Figure 14.4. 


Such an fy must be unique by Lemma 14.1. 
As a simpler example than that in the previous section, take 


A@M=Yiz" da<) 
n=0 
A@=Wa-y @EeC\{I}) 


Then /2 is a direct analytic continuation of f;. Whereas f; is defined only in the interior 
of the unit disc, f2 is defined on the whole of C except at 1. 

Before going on to the general case, a series of overlapping domains, it is worth 
dealing with another phenomenon. Sometimes D; may be such that f; has no direct 
continuation to any D2 not contained in D}. In this case the boundary of Dy is called 
a natural boundary for f;. Now the limitation does not stem from our choice of tools, 
but from intrinsic properties of f,: it so happens that D, is the end of the line as far as 
analytic continuation of f, is concerned. 

The standard example of this phenomenon is 


Figure 14.4 Direct analytic continuation. 
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jo=) 2 (14.5) 
n=0 


which converges (so is analytic) for |z| < 1. We prove: 


PROPOSITION 14.3. The unit circle S = {z € C: |z| = 1} is a natural boundary for 
the function f defined by (14.5). 


Proof. Let 2 = elP/4 for p,q, € Z,q > 1. Then Za = | for alln > gq. We show first that 
as Z —> zo then f(z) > oo. Let z = rzg where 0 < r < 1. Then 


f@) = > (r20)" 
n=0 


[o,2) 

= (Lt reg too fr DEED > i 
n=q 

= g(r) + h(n), say. 


Note that h(r) has the above form because Z = 1 for all n > q. For any fixed integer 
N>0, 

q+N 

Yor sN+1 asr > 1 


n=q 


So for some € > 0,if 1— ¢ <r < 1, then 


Then 


so that h(r) > oo asr > 1, and h(r) > oo as z > zg. On the other hand, 
a) =1Lt ete ttre ase x 
Therefore 


lim f(z) = co (14.6) 
Z—>Z0 


Now, suppose that F is a direct analytic continuation of f to a domain D2 not contained 
in D,. Then dD; NM D2 is open in dD; = S, so it contains some point z7 = e?/4 for 
D,q,€ Z,q => 1 since such points are clearly dense on the unit circle. There is a small 
disc D, round zg such that D, C D2. Now for 1 > r > 1 — € we have rz € D, , so that 


F(rzo) = f(rz0) 
Asr— 1, 
F(rzy) > 00 


by (14.6). But F is analytic in D2, so 
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Figure 14.6 Sequence of overlapping domains. 


F(rz0) — Fo) 


as r — 1, by continuity. This is a contradiction; therefore no direct analytic continuation 


of f to D2, not contained in Dj, is possible. 


Figure 14.5 illustrates the geometry of this proof. Informally, the point is that the 
singularities e!”/% are so close together that there is no room to squeeze a disc between 
them, onto which f might be analytically continued. 


Indirect Analytic Continuation 


We now return to the question of performing a sequence of direct analytic continuations, 
one after the other. 


DEFINITION 14.4. Let Dj, ...,D, be domains such that 
D;OADr4, #0 (r=1,...,n—-1) 


as in Figure 14.6. If (f;) is a sequence of analytic functions defined on D, such that 
f+. is a direct analytic continuation of f, (r = 1,...,n — 1), then f,, is an analytic 
continuation of f,; from D, to D,,. 


An analytic continuation that is not direct is an indirect analytic continuation. 


Unlike the direct case, we may obtain different results by using different sequences of 
domains, Figure 14.7. Indeed, we can even have a sequence of overlapping domains for 
which D, = Dj, but f, 4 fi, so that on returning to our starting point we end up with 
a different function from the original one. We illustrate this possibility by an example 
in the next section. But first, we apply the above ideas to define a broader concept of an 
analytic function. 
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Figure 14.7 Sequences of overlapping domains that may lead to different analytic continuations. 


Complete Analytic Functions 


If f is analytic in a domain D we call the pair (f, D) a function element. We define a 
relation ~ on the set of all function elements by 


(fi,D1) ~ (f2, D2) 


if f2 is an analytic continuation of f; from D, to Dz. Since indirect continuations are 
allowed, it follows easily that ~ is an equivalence relation. A complete analytic function 
is an equivalence class under ~ of function elements. 

In other words, a complete analytic function in the new sense is an analytic function in 
the old sense, together with all of its analytic continuations. It is clearly more convenient 
to have the analytic continuations ‘built in’ than to have to fit them together as the 
occasion warrants. 

If there exist function elements (f1, D; and (f2, D2) of the complete analytic function 
F such that for some z € D, M D2 we have f1(z) 4 fo(z), we say that F is multiform. If 
not, then F is uniform. All the examples considered so far in this chapter are uniform, 
but we give examples of multiform functions in the next section. A multiform F is a 
formal version of the classical idea of a ‘multivalued function’, with the advantage that 
it is broken down into pieces on each of which it is a genuine single-valued function. 
The multiformity arises because of how these pieces fit together. A geometric approach 
to this leads to the concept of a Riemann surface, discussed informally in Section 14.5. 


Multiform Functions 


A simple instance of a multiform function is 


{Da=s% 


If z = re’? then we can choose for ./z either 


VJre8l2 op reid /2+7 
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where ./r is real and positive. With the old concept of an analytic function, we must 
choose one of these arbitrarily, and then f(z) is analytic only if we make a cut in the 
complex plane. From the present viewpoint, we can do better. We spell the method out 
in considerable detail — just this once. 
We introduce four domains: 

Ay = {ze C:rez> 0} 

Hy = {ze C:imz > 0} 

A3 = {ze C:rez <0} 

Ag = {ze C:imz < O} 
which are the open half-planes to the right, top, left, and bottom of the complex plane. 
Let z = re!’ where r > 0,0 € [—, 1]. Define 


AQ = J/re'9/2 for zeD = 
Alida = a/rel?l@ for z€D, =H 
/rei?/2 for z € D3 = H3,imz > 0 
BD = Jreil0/2+m) for zéD3 =H3,imz <0 
f(a = JreiO/2+7) for zED,a=M 
BQ) = re for 2 Ds = 
fl = Srei@/2+n) for z€ De = Ay 
fe & "eal for z¢€D7=4H3,imz>0 
Jre'?/? for z¢€D7=4H3,imz <0 
RD = free! for z€Ds=Hy4 


Each of these eight functions is analytic in its domain of definition; this is why for f 
and f7 we have to change our choice of ,/z as we cross the imaginary axis, because 
otherwise our choice —z < 6 < a would introduce a discontinuity. Further, f-+1 is a 
direct analytic continuation of f- (r = 1,...,7) and f| is a direct analytic continuation 
of fg. For each z € C \ {0} the values f(z), where defined, are one or other of the two 
possible square roots. Further, for each r, the function f(z) takes one of the two values 
and f;+4 takes the other value, interpreting r+ 4 modulo 8 (and replacing 0 by 8). 

Thus we have defined a multiform function, taking two values at each z # 0. Going 
round the origin once, from D; to D2 to D3 to D4 to Ds = Dj, we reach a different value 
of f(z) from the original one. However, in this case, going round a second time returns 
us to the original value. This last effect is a special feature of ,/z, as the next example 
shows. 


The Logarithm as a Multiform Function 


Example 14.5. An immensely important multiform function is 


F(2) = logz 


Its multiformity was Euler’s great discovery, arising from the Bermoulli—Leibniz 
controversy mentioned in Chapter 0. In terms of the present discussion, we introduce 
domains 
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Dar = Hy (keZ,r= 1, 2,3, 4) 
with the H;, as above. To avoid the kind of two-piece definition that occurred for f3 and 
f7 above, we proceed as follows. For z € D, and z = re” where 
n—2 
2 


n 
n<O0< —0 
2 
we define 
fA = logr+i0 
Then f,, is analytic on D,. On Dj, 
fi@) = Logz 
the principal value of the logarithm. On Ds, 
fs(z) = Logz + 271 

On Dak+1, 

fart (z) = Log z+ 2kri 


It is not hard to check that f,, is a direct analytic continuation of f, from D,,, to D, for 
all s € Z. For each r = 1,2,3,4 andk ¢€ Z the values of f4,,,(z) (z € H,) give all the 
infinitely many possible values 


log |z| + Qka + argz)i 


of the logarithm. 


Singularities 


We can now give a general definition of a singularity. If no analytic continuation of f can 
be defined at a point zo then we say that zo is a singularity of the corresponding com- 
plete analytic function. We have met several kinds of singularity before: poles, isolated 
essential singularities, natural boundaries. Removable singularities are not singularities 
at all according to the new definition, because ‘filling in’ the missing point is a form of 
analytic continuation. 

For ./z and logz we encounter a new kind of singularity, called a branch point. 
Analytic continuation round such a point leads to a change in value. Notice that for 
f(z) = ./z there is even a natural definition of f(z) at the branch point itself, namely 0, 
but f is not analytic there. This is not the case for the logarithm. 

Multiform functions make an appearance in contour integration whenever different 
choices of path from zo to z; leads to different values of the integral 


21 
F(z) =| f(2dz 
40) 


(as may occur, for instance, if f has a pole in the region between the two paths). Hence 
quite nice singularities of f, such as poles, give rise to much nastier singularities of F, 
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namely branch points. This can occur even when f is uniform: for instance, 1/z is uni- 
form but its integral log z is multiform, and the pole of f at 0 has become a branch point 
of F. 


Riemann Surfaces 


Riemann was a great geometer, and he invented a geometric way to envisage multiform 
functions, which is much more intuitive than an equivalence class of function elements. 
His idea is to replace C by a more complicated ‘Riemann surface’. Roughly speaking, 
the idea is to ‘glue together’ the domains of function elements at overlaps where the 
functions agree. We sketch this idea in general later; our immediate aim is to describe a 
few key examples. 


Riemann Surface for the Logarithm 


In the case of the logarithm we can describe the construction informally in the following 
terms, which should not be subjected to too deep scrutiny of a logic-chopping kind. We 
are not attempting a rigorous definition at this point; the informal description, though it 
may sound far-fetched, is in fact capable of being given a rigorous rendering, described 
at the end of this section. 

Consider a collection of copies Cx of C, one for each k € Z. Slit each Cx along the 
negative real axis from 0 to —oo. For each k join the top left-hand quadrant of Cx to 
the bottom left-hand quadrant of C;,, along the slit. The resulting surface, Figure 14.8, 
resembles a spiral staircase or a concertina, with the planes stacked on top of each other 
in order relative to k, and with a continuous spiral path from any Cx to any other C; 
going up the ‘steps’ where the slits were joined. These planes Cx are the sheets of the 
Riemann surface, and it is convenient to imagine the whole collection to be stacked on 
top of C as shown in Figure 14.8. 

We define the logarithm for points on the Riemann surface by 


log |z| + (2ka + argz)i 


anaes 


Figure 14.8 Riemann surface for the logarithm. 
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Figure 14.9 A chain of overlapping domains that closes up in C no longer closes up when it is 
transferred to the Riemann surface. 


fara) freee: 
Figure 14.10 Riemann surface for the square root — schematic of how the two sheets join. 


where Z is the point of Cx lying directly above the point z € C. This is a single-valued 
function on the Riemann surface, and it is continuous in the sense that the values join 
up correctly across the cuts. (In a similar sense we can even say it is differentiable: the 
derivatives also join up correctly across the cuts.) 

Now we describe how to relate analytic continuation in C to the corresponding opera- 
tion on the Riemann surface. Corresponding to a closed chain of domains D),...,D, = 
D, <C, we have Di pe ,D', on the Riemann surface. In C the values of the functions 
fi,--->fn agree on the relevant overlaps Di 1 D2,...,Dn—1 \ Dn, but because of the 
multiformity they do not agree on D, and D,. 

In contrast, on the Riemann surface we find that if we make dD and D, overlap, then 
D‘, and D},, and so on, then by the time we get to D!, it no longer coincides with Dj 
because we have moved one level up the Riemann surface, see Figure 14.9. Hence Di 
and D’, no longer overlap, and the difference between f; and f, is only to be expected. 
The multiform nature of analytic continuation is automatically taken care of by the mul- 
tiplicity of the sheets of the Riemann surface and how they join together. Even the nature 
of the singularity at 0 is apparent from the geometry of the surface, because here all the 
levels meet. 


Riemann Surface for the Square Root 


Similarly for ./z we obtain the Riemann surface by taking two copies C; and C2 of 
C, slit from 0 to —oo. Then join the top left-hand quadrant of C; to the bottom left- 
hand quadrant of C, along the slit; and similarly join the top left-hand quadrant of C2 
to the bottom left-hand quadrant of C,. (To do this in three-dimensional space requires 
allowing the surface to intersect itself, but there is no conceptual problem here, see 
Figure 14.10.) 

Again the phenomena noted above are clear from the geometry of the Riemann 
surface: the two values of f(z), the fact that going round the the branch point at the 
origin once changes the value but going round twice does not. 
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4/ 4 


Figure 14.11 Gluing successive domains to form the Riemann surface of the logarithm. 


Constructing a General Riemann Surface by Gluing 


Obviously we cannot rely on such ad hoc methods to construct the Riemann surface 
in general. We can approach the general method through the same example of the 
logarithm, but building up the surface in a different way. Consider the sequence of 
half-planes D, and functions f; used for analytic continuation above. Suppose we take 
half-planes D‘, corresponding to the Ds, but make them pairwise disjoint. (The D, are not 
disjoint; for instance Dj = Ds and so on.) Then we ‘glue’ the D‘, together in the follow- 
ing way: D, and D,,, overlap on a quadrant in which f, = f;+1, So we glue together the 
corresponding quadrants Dj and D’_, ; where they overlap. Then D4, glues on top of Dj at 
the top right-hand quadrant; D; glues on to the top left of D/,; D, glues on to the bottom 
left of D; and Dg glues on to the bottom right of Dj. at this point Dé is lying directly 
over D', but we do not glue them together since f| and fs are different, Figure 14.11. 
Continuing with Dj, D4, ... (and in the downward direction Do, D'_,, D_,,...) we build 
up the spiral staircase Riemann surface again. 

The general method for constructing a Riemann surface of a complete analytic func- 
tion F follows the same lines. Recall that F is an equivalence class of pairs (f, D) where 
f is analytic on a domain D and (f1,Di) ~ (f2, D2) if fi,f2 are equal on the non-empty 
set D; 1 Dj. We take disjoint copies D‘, of all the domains D,, occurring in the pairs 
(fi,,D)) that belong to F. If D, 1 Dy # Y and fi, = f, on it, we glue Dj to Dj, at the 
points corresponding to the overlap D, N Dy. 

We make this description more rigorous. First, the question of ‘disjoint copies’ D), of 
D,,. One way to achieve this is to define 


Di, = Dj x {A} 
Then if A A pw it is clear that D) 1 D/, = Y, and there is a natural bijection 
Jan 2 Dy > D, 
defined by 
Jals) = (8,4) (s € Dy) 


Now for that ‘gluing’. This is accomplished by a simple trick, using yet another equiva- 
lence relation, but it requires a certain amount of sophistication to appreciate that it does 
what is required. 
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Let 
(2,,A) © (Zp) 
for z, € Dy, Zy € Dy, if: 


(i) ZA =2y 


Gi) fi) =fu Gp) 


Then the set of equivalence classes under ~ of points in the union of all the D{ is 
defined to be the Riemann surface of F. The equivalence relation ~* acts as the glue. 
This definition is quite elegant, but might not be immediately appealing. 

The advantage of the Riemann surface construction is that we can now think of a 
multiform complex analytic function as a genuine function, defined not on C but on the 
appropriate Riemann surface, rather than as an equivalence class of function elements. 
The Riemann surface gives a ‘global’ view of the function, instead of chopping it up into 
‘local’ pieces. The last few sections of this chapter revisit previously discussed material 
to explore some of the insights that can be gained in this way. 


Complex Powers 


So far we have considered powers z“ of a complex number z only for rational a, where 
2/4 is a qth root of 2”. We now define it for arbitrary a € C. We would like to do this so 
that the laws 22+? = zz) and (2%)? = 2”, (zw)* = zw remain valid. It turns out that 
we can ‘almost’ do this: however, z“ is in general multiform, and the formulas hold only 
if we choose appropriate values. Even when ais rational this problem already arises. 

A natural approach is to write 


z= re? = elogr-+id (14.7) 


where logr € R, and let a = a +1. On the assumption that the above laws hold, we 
obtain: 

ft — (re? 4 

= eillogr-+18) 


= e(@+ip)(og r+i0) 


— e& log r—B0 ei log r+a0) (14.8) 


We therefore define 2“ by (14.8). This makes sense for all z 4 0, and for all a € C. It 
amounts to requiring that 


fae (14.9) 


which is an equally natural way to define z*. It follows that in any domain for which we 
may define a unique branch of log (such as the cut plane C, for any p) we may also 
define z* as a single-valued differentiable function. 

In general, however, since log is multiform, so is z*. To see what the possibilities are, 
choose a particular value 09 for @ (the obvious choice is the principal value of arg z). 
Then the possible 6 that make (14.7) hold are of the form 
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06=%+2nn (nEZ) (14.10) 


Write 
(2")n = eilog r+i(69+2nz)) 
which is the ‘nth branch’ of 2’, obtained by substituting 6 from (14.10) into (14.8). The 
question is now: how does (z“),, depend on n? 
From (14.8) we have 


Cy Serr ee 5 (14.11) 


where (z“)g is the Oth branch. Using (14.11) we can answer the above question. There 
are three cases: 


(a) If 8 € 0 then z* has infinitely many values, one for each n, because 
Keen) Sie ee 


which takes distinct values for distinct n. The Riemann surface for z“ in this case 
has the same form as that for the logarithm: an infinite ‘spiral staircase’ on whose 
nth layer the function is given by the branch (z“)n. 

(b) If 8B 4 0,soa =a € R, and if further a is irrational, then z“ = 2° is again infinitely 
many-valued, with distinct branch values for distinct n. For if (2%)m = (2*)n then 
eeMTia — g2nTia so e27i(m—na — |, This implies that (m—n)a € Z by Section 5.7, 
which implies that m = n since a is irrational. The Riemann surface for z® is the 
same as for case (a), but unlike case (a), the moduli of distinct branches are equal. 
Only the arguments vary. 

(c) If 8 £0,soa =a ER, and if further o@ is rational, then we can write a = p/q in 
lowest terms, where p,g € Z. Now 2 = J/@. 


Following the discussion in (b), two branches give the same values if and only if 


(m—n)- eZ 
q 


which happens if and only if g divides m — n, because p and g have no common factor 
greater than 1. 
It follows that the value of (z*), depends only on n mod (q). The branches 


(zo, (2*)1, ee) (2*)g-1 
are distinct, but the gth branch repeats: 
(27 )q = (2")o 


and repetitions continue thereafter. 

The Riemann surface, of course, is a g-sheeted spiral, with its top sheet glued to its 
bottom one, as in our discussion of z!/* but with q sheets instead of 2. Figure 14.12 
illustrates the case g = 5. 
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Figure 14.12 Section through the Riemann surface for zP/4 when q = 5, showing how the layers 
glue together. 


Figure 14.13 Conformal map of a half-plane to a wedge. 


yryyyyy - Av tee 


Figure 14.14 How the conformal map of a half-plane to a wedge transforms streamlines. 


Conformal Maps Using Multiform Functions 


Conformal mapping with multiform functions requires careful attention to how values 
are specified; again, the simplest way to keep track is to put everything on a Riemann 
surface. Here we consider only one case, of practical importance: the map z > 2% for 
real wa > 0. We assume the domain is the cut plane C,,, on which the map may be defined 
to be single-valued. 

The great virtue of this map is that it transforms a half-plane into a wedge, as in 
Figure 14.13. The vertex angle of the wedge is az, so we have to assume a < 2 to 
prevent the image overlapping itself. The map is conformal except at the origin. 

For example, uniform fluid flow, with streamlines re z = constant, transforms to the 
flow round a corner of angle a7, as in Figure 14.14. 

If we take a = 1/2, the corner is right-angled. Then z = x-++iy and z!/2 = w =u+iv. 
The flow-lines are x = constant. Now z = w* so xx = u*—v*, y = 2uv. The streamlines in 
2 


the w-plane are given by u’—y* = constant; they are branches of rectangular hyperbolas. 


14.8 


14.8 Contour Integration of Multiform Functions 305 


Figure 14.15 Transferring a contour to a Riemann surface. 


In combination with other ‘standard’ conformal maps, z* provides a useful weapon 
for the applied mathematician. 


Contour Integration of Multiform Functions 


It is conventional to interpret a contour integral i, f of a multiform f by choosing the 
values of f(z) so that they vary continuously along the contour. (This still leaves an 
arbitrary initial choice to be made, which must also be specified.) A more civilised 
approach is to break the contour y into a sum y = y, +---+ y, such that: 


(i) Each y; lies inside a domain Dj; on which f may be defined as a single-valued 
function. 

(ii) On each D; we choose a branch of f that makes the values agree where y; and y;+1 
join. 

In terms of Riemann surfaces, we can interpret this process as the definition of a 
contour integral when the contour lies on the Riemann surface. The way the contour 
wanders up and down the staircases on the surface automatically takes care of the choice 
of values of the multiform function f(z), see Figure 14.15. In practice this is all more 
straightforward than it may sound, and the computations are no harder than the uniform 
case — provided you use your head. 


Example 14.6. Let y be the contour 


y(t) =(1+4+ne" (t € [0,67]) 


/ Maz 
y 


Method 1 (stupid): Defining z!/> on the contour by (y(t))'/5 = (1 + 1£)!/5e4/> makes it 
vary continuously with t. By Theorem 6.6 


Find 
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Table 14.1 Choice of 0. 


domain 6 chosen in interval 
Dy, [-2 /4, 52/4] 

D2 [32 /4, 97 /4] 

D3 [7x /4, 1377/4] 

D4 [112 /4, 1727/4] 

Ds [157 /4, 2127/4] 

D6 [19 /4, 2577/4] 


Figure 14.16 Six overlapping domains. 


60 


i f(z)dz = f (f(y@)y'O dt 
y 
67 
= y (+ )!/5e!/5 Gel + ite” + el”) dt 
0 


which leads the hopeful calculator along some fascinating byways inhabited by integrals 
such as 


6 
/ 1(1 +1)! cos(6t/5)dt 
0 
A wise person knows when to cut their losses. 
Method 2 (sound but pedestrian): Cover the contour y by domains in which z!/° may 
be given a unique value for each z, which is differentiable. For example, we can find 
six domains D,,...,D6, one for each of the intervals AB, BC, CD, DE, EF, FG in 
Figure 14.16. 

To make the values match on overlaps we choose the definition of z 
Write z = re’ where the choice of @ is as in Table 14.1. 

On each domain choose the branch of z!/> given by r!/5ei#/5 
in the table. These choices make 6 agree on overlaps. 


1/5 as follows. 


with 6 in the range stated 


14.8 Contour Integration of Multiform Functions 307 


Break up the integral as 


ee 


In each domain Dj the function z!/5 has an antiderivative a 5 where z°/> is defined 
using the corresponding branch 2/5 = 6/9 Q618/9 | By Theorem 6.33, if PQ is any of the 
arcs AB, BC, and so on, then 


i AP dy 5.6/5 _ 5 p6/5 
PQ 


When we add the six expressions of this type, all terms but two cancel, so the integral is 
56/5 __ 56/5 
go — gA 


Here A = y(0) =1,G =y(6z7) =(1 + 67 e!* . Evaluate the 6/5th powers according 
to the above prescription: 


A9/5 =(1- @i0)6/5 =] 


Therefore the integral is equal to 


— 311 + 6) ei*/5 + 1] 


Method 3 (slicker): In terms of the parameter ¢ we can define the required branch of zs 


to vary continuously along the contour y if we put 
zi/5 = (eu) ae — ( + t)'/Seit/5 


There is a local antiderivative ac 5, whose value varies continuously along the path y 
if we choose branches so that 


(D5 = + 99/5eSt/5 


Now pave y by a finite number of domains, on each of which zis may be rendered 
single-valued, choosing the branches to agree with those already chosen on the path. As 
above, the integral may be expanded as a sum of terms of the form 


ae + ty 1 )/ biti /5 (i+ 1;)9/5 6it/5] 


where the t; subdivide the interval over which ¢ runs. All but two terms cancel, leaving 
the same answer as before. 

The advantage of this method is that by choosing branches in an obvious way for 
y(t) we may leave the prescription of the domains, and the branches on those domains, 
implicit. 


Method 4 (smart): Let the Riemann surface do the work. The above analysis easily 
generalises to give a version of Theorem 6.33 for multiform functions, integrated along 


308 


Analytic Continuation 


a contour in the Riemann surface. Given a global antiderivative F for f on the Riemann 
surface, we have 


if f= Fe) —F@) 
¥ 


where zo is the initial point, z; the final point, and the branches are chosen with reference 
to the end points of y on the surface, as in Figure 14.15. This gives the result almost 
immediately on noting that y winds three times anticlockwise, so the 6/5th power winds 
3-6/5 = 18/5 times anticlockwise, giving for F(z1) the argument 27 - 18/5 — 367/5, 
if we choose argument 0 for Zo. 


We leave it as an exercise to formulate and prove generalisations of Cauchy’s Theo- 
rem for contour integrals on the Riemann surface of a multiform function. For those who 
prefer not to use the insight that Riemann surfaces provide, however, we recommend 
Method 3 above as a relatively direct and simple technique. 


Example 14.7. Example 14.6 is of course a bit artificial, and its main interest is ped- 
agogical. A more typical example of integrals likely to be encountered in practice 
is 
e,@) xe 
| ——dx (a€R,0<a<1) 

o l+x 
Note that the restrictions on a make this integral converge, so the question is sensible. 
We begin with the complex function 


fg =404+2>! 


which is differentiable for all z 4 0,i, —i, and is multiform for all a in the given range. 

Make a cut along the positive real axis and work in the cut plane Co (in the notation 
of Section 2.2). On this domain z“ may be rendered single-valued: set (re'?)¢ = r“e%° 
where 6 € [0, 27]. Now integrate f(z) along the contour y in Figure 14.17. This runs 
from the real point p to the real point R, then once anticlockwise round the circle of 
radius R, then back to p, then once clockwise round the circle radius p. 


Figure 14.17 Contour for Example 14.7. 
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By Cauchy’s Residue Theorem, Theorem 12.3, 
[ pow = 27i(> residues inside y) 
y 


We intend to let o — 0 and R —> oo, so we may as well assume that op < 1 andR > 1. 
Then the singularities inside y are z = ++i. We calculate the residues at these points as 
follows: 

At z =i the residue is 


| oiax/2 


a =limz’?/(z+i)= 
zi 271 


At z = —i the residue is 
1. 
= ji a 3) — ___e@ia3n/2 
ES eee = pe 


Let the circle of radius R be y1, and that of radius p be y2. Then Cauchy’s Residue 
Theorem implies that 


R x4 za p (xe271)4 aa 
dx-+ dz dx-4 dz = 277i 14.12 
/ 1 +x? . ; 1+22 [ 1+ x2 p I. 1+2 6 A BN ) 


Note that in the third integral we must choose the branch of z% corresponding to argu- 


ment 277, to retain continuity. Now let 9 — 0 and R > oo. Easy estimates show that 
the second and fourth integrals tend to zero. The first tends to 


Co ya 
if ds 
(0) 1 + x2 


and the third to 


Therefore 


2nia / oo xt ‘ 1 ian /2 1 ia3n/2 
(d-e"") dx = 271 -e =e 
9 1l+x ni 


After some manipulation, this leads to 


oe Ww 
/ 5 dx = 
o I+x 2 cos(az /2) 


Yes, but... did you spot the nifty footwork? The chosen contour does not actually 
lie inside the domain Cy. Nonetheless, the result is correct. There are several ways to 
justify the answer rigorously. They include: 


(1) Replace y by a contour that starts just above p, say at o + ie, runs horizontally to 
just above R at R + ie, circles y; to just below R at R — ie, runs back to below 
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Figure 14.18 Modified contour remains inside Co. 


Figure 14.19 Lifting y to the Riemann surface. 


p at p — ie, and the goes back round yz as in Figure 14.18. Now, as p — 0 
and R — oo, make the width 2¢ of the channel tend to zero and use continuity 
arguments. 

(2) The choice of Co causes problems, so reject it. Work in two overlapping domains, 
say C_,/4 and C,,/4, and switch between them when defining the second, third, and 
fourth integrals in (14.12). 

(3) Put it all on a Riemann surface. The contour y then winds round the staircase and 
up one step, then winds back down one step to get back to the start. 


Everything generalises to such contours, because they are obviously boundaries of 
rectangles in the Riemann surface, Figure 14.19. This argument is an example of the 
‘homology’ version of Cauchy’s Theorem, when generalised to Riemann surfaces. We 
discuss homology in Chapter 16, but do not extend the ideas to Riemann surfaces since 
this would take us too far afield. 

These three approaches justify the computational method rigorously. Of course, when 
performing the calculations, it is not necessary to trot out such a justification every time 
— provided you understand how it would go. Everything works fine as long as you follow 
the golden rule: make the multiform function vary continuously as you walk round the 
contour. 


14.9 Exercises 311 


Exercises 


1. Define three power series by 


W=lteepeteo= ye 
n=0 
Wz) =i-(@-i-N—-i@-i-1)? +@-i-D?+--= VPM E-i- vy" 
n=0 
oz) = -1+(¢-2) -—@ 2 +2 = Oe - 2)" 


n=0 
Find their discs of convergence and sketch them. Prove that a(z) = b(z) on the 
overlap of their discs of convergence, and similarly that b(z) = c(z) on the overlap 
of their discs of convergence. Do the discs of convergence of a(z) and c(z) intersect? 
. Let f(z) = Ore an(z — 21)” and 9(z) = OP o balz — 22)" be two power series, 
and assume that their discs of convergence have non-empty intersection. Prove that 
g is a direct analytic continuation of f if and only if there exists zo in both discs of 
convergence such that form = 0,1,2,3,... the equations 
m 
Yin —1--@—m)lan(eo — 21)" — ba(zo — 22)" ] = 0 
n=0 


hold. 


. A certain function has singularities at precisely the points z such that rez and imz 
are integers. You are given its Taylor expansion around the point + ai. What is 
the smallest number of stages needed to find an indirect analytic continuation, by 
power series, to a domain that contains b + 3i? 


. Show that the functions defined by 


@ fg =1l4ezt+ 24774 Be--- 42" 4... 
Gi) gi) =14z4+ 24274 274.-.-42"% 4... 


have natural boundaries at |z| = 1. 


. Suppose that f(z) = }° anz” has radius of convergence 1. Set 
bid 2 
Z= — =w-wtw Wwetess 
l+w 
and transform f(z) to a power series in w, say F(w) = )¢ b,w". Prove that this latter 
power series has radius of convergence > 5; and that if —1 is a singular point of f, 


the radius of convergence is exactly a 


. Show that the series 

CO 

Sita) alae 1 
n=1 


converges when |z| > | or |z| < 1, but that the two functions so represented are not 
analytic continuations of each other. 


312 Analytic Continuation 


7. Suppose that f and g are defined, and have no singularities, on the whole of C. 


10. 


11. 


12. 


Define 


oo 
1—z" 1—zl 
99 =D (15 =) 


n=1 


Show that 
1 f(z) + 9(z)) + 4 P(e F(Z) — 92) 


is equal to f(z) when |z| < 1, but to g(z) when |z| > 1. 


. Let f(z) be the (multiform) function z,/z. Show that at z = 0 there exists a first 


derivative that is the same for all branches, but a finite second derivative does not 
exist. What about z” log z? 


. Describe the Riemann surfaces of the multiform functions: 


i) /z+ 4 
Gi) JI- 3 


(iii) cos~! z 
(iv) tan! z 


Describe the Riemann surface of 
[ie — Iz — 2) 71" + (2-3)? 


Let w € C. Show that the values of 1° form a subgroup U,, of the multiplicative 
group of non-zero complex numbers. Show that U,, is cyclic of order q if w is a 
rational number p/q in lowest terms; otherwise, U,, is infinite cyclic. 

Show that U, is contained in the unit circle if and only if w is real; lies on the 
positive real axis if and only if re w is an integer; and otherwise lies on a logarithmic 
spiral parametrised by ¢ € R in the form e® for some fixed complex number a. 


Show that the function 
f@=e™BVz =i 


defines a conformal map from the domain in Figure 14.20 (left) to the upper half- 
plane. 


ah 


Figure 14.20 Left: Domain for Exercise 12. Right: Domain for Exercise 13. 
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Figure 14.21 Sector occurring in Exercise 14. 


Figure 14.22 Contour for Exercise 16. 


13. 


14. 


15. 
16. 


17. 


18. 


Show that the function 
2/3 
_ -in/3 ( ~ 1 
f@ =e eS *) 


defines a conformal map from the domain in Figure 14.20 (right) to the upper half- 
plane. 


Find the image of the sector —z/n < argz < a/n, |z| < 1 (Figure 14.21) under 
the conformal transformation 

f@=2itzy" 
where nis a positive integer. 
Let y(t) =e", (t € [0, 477]). Calculate dy ,/zdz where at t = 0 we take JI =1. 
Using the contour in Figure 14.22 show that 


lo) xk 
i dx = mcoseckz (0 < k <1) 
o Il+x 


Show by contour integration that 


/ = “_ (a>0) 
— a> 
0 (ltax2)JI—x2 2/1 +a 


Using the contour of Exercise 16, show that 


eo xe m(1 —a) 
dx = (-1 <a< 3) 
o U+x) 4cos ha 
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Figure 14.23 Schwartz Reflection Principle. 


19. Let y(t) = (1+ 4cost+isins)>, (¢ € [0,27]). Find 
7 2” logz — J/zt(z — 5)! !dz 
Y 


20. Describe the Riemann surface of the function 


fQaVzt2ZtAt¢Bter topes 


21. Schwartz Reflection Principle. Let U be a domain in C that is symmetric about the 
real axis (that is, if z ¢ U then z € U). Let 


Ut ={zeU:imz> 0} 
U- ={ze U:imz < 0} 
Uo ={zeU:imz=0} 
as in Figure 14.23. Suppose that 
fer uUry Sec 
is continuous, analytic on U*, and takes real values for z € U ° Then there is an 


analytic function F : U > C such that F(z) = f(z) for all z € Ut UU®. 
(Hint: define F(z) = f(@) for z € U~, and use Morera’s Theorem.) 


15 


Infinitesimals in Real and Complex 
Analysis 


Historically, calculus began with the study of variable quantities represented as curves 
in the plane. These variables could become ‘arbitrarily small’ or ‘infinitesimal’. They 
were used to compute derivatives and integrals, and to formulate differential equa- 
tions modelling systems that change dynamically. However, the precise meaning of an 
infinitesimal caused controversy. At least two different concepts existed: a ‘number’ (of 
some kind) that is smaller than any non-zero number, or a variable number that can 
become smaller than any positive number. The two were often confused. Moreover, it 
is not easy to formalise the idea of a ‘variable number’. (If it varies, how can it be a 
number?) 

Because of these difficulties, infinitesimals, in either sense, were eventually dis- 
placed by the logical epsilon-delta formulation of Weierstrass in the late nineteenth 
century. The real numbers form a ‘complete ordered field’, and therefore do not contain 
infinitesimals, so infinitesimals were banished from formal mathematical analysis. 

Meanwhile, applied mathematicians and theoretical physicists continued to model the 
real world using ‘arbitrarily small’ quantities. The familiar formula T = 27./L/¢ for the 
period of a pendulum, derived using the approximation of simple harmonic motion, is a 
familiar example. Usually described as ‘valid for small amplitude swings’, this formula 
is actually incorrect for any non-zero amplitude. But is is also a very good approxima- 
tion for sufficiently small amplitudes, and more accurate formulas cannot be derived 
or expressed using elementary functions. This procedure is typically justified by let- 
ting the approximations concerned become arbitrarily small — the main traditional view 
of an infinitesimal. Thus two parallel but apparently conflicting approaches to calculus 
arose: one with, and one without, infinitesimals. Moreover, the tacit assumptions about 
infinitesimals applied to two different concepts. 

In this chapter we reconcile these views, offering a formal approach to infinitesimals 
that lets them be manipulated algebraically. Real infinitesimals can be visualised on an 
extended number line, and their complex analogues can be visualised in an extended 
complex plane. Infinitesimals are made visible to the human eye by a process of mag- 
nification, which is defined algebraically and has a natural visual interpretation. The 
formal representation includes the idea of an infinitesimal quantity as a process that 
tends to zero, and also as an element in an ordered field extension K of the reals, and 
in the corresponding extension of C. We prove a simple structure theorem for such an 
extended system that any finite element x in the extended system (meaning |x| < r for 
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Figure 15.1 Leibniz’s view of dx, dy as components of the tangent. 


some real number r) is uniquely of the form x+ h, where x is a real or complex number 
and h is infinitesimal or zero. 

This approach provides a formal theory that encompasses both the standard epsilon- 
delta approach to analysis in pure mathematics, and the ‘applied’ approach in which 
infinitesimals are thought of as variables that tend to zero. 


Infinitesimals 


Leibniz, Newton, and their contemporaries used various imaginative devices to deal 
with differentiation and integration. Leibniz’s first publication on calculus specified the 
derivative dy/dx as the quotient of two finite numbers dy and dx. Figure 15.1 illustrates 
his procedure. Suppose that the tangent to a point P on the curve mets the horizontal 
axis at B. For any finite horizontal displacement dx, let dy be the corresponding vertical 
change of the tangent. Then the ratio dy/dx is the same as y/BX. 

In this interpretation, dx and dy do not represent infinitesimals. They represent finite 
components of the tangent vector. When the slope of the tangent is m, we can take any 
finite number dx to be the horizontal component, and the vertical component is then 


dy = mdx 


The slope of the tangent to the curve at P is the quotient dy/dx. 

The devil lies in the detail. The definition makes sense only when the slope of the 
tangent is known, and to do that, it must be calculated. This is where infinitesimals 
came in. Leibniz performed the calculation by imagining the graph to be a polygon 
with an infinite number of infinitesimally small sides. The tangent at a point was the 
prolongation of one of these sides, as in Figure 15.2, where the lengths of the sides are 
imagined to be arbitrarily small. 

He also conceived of different degrees of infinitely small or infinitely large quantities, 
by comparing the size of a ball to that of the Earth, or to the distance to the fixed stars. 
This led him to imagine infinitesimals of different orders of size. An infinitesimal of 
second order is infinitely smaller than an infinitesimal of first order, and so on. The 
reciprocal of an infinitesimal of a given order is an infinite element of the same order. 
For instance, if v is a first order infinitesimal, then v? is a second order infinitesimal and 
1/ v’ is an infinite element of second order. Leibniz’s ideas were natural, in the practical 
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Figure 15.2 The tangent as the extension of an infinitesimal side. 


sense that he imagined arbitrarily small or large quantities, and in the theoretical sense 
that they can be manipulated symbolically. 

Their logical status was less clear, and disputed. The philosopher George Berkeley, 
ridiculed infinitesimals, claiming that it was beyond common sense to imagine quanti- 
ties smaller than any sensible quantity, and even more nonsensical to imagine quantities 
that are smaller still. He referred to infinitesimals sarcastically as ‘ghosts of departed 
quantities’, neatly confusing the two different interpretations. For generations, mathe- 
maticians disputed the precise meaning of this elusive concept, seeking to circumvent 
the logical difficulties. 

Cauchy succeeded, but his ideas were widely misunderstood, even today. He for- 
malised the ‘variable quantity’ approach, defining an infinitesimal as a sequence a = 
(ay) that tends to zero. He could then calculate f(x+«) for infinitesimal a as the sequence 
(f(x+ a,)). He also formulated the notion of continuity for a function in such terms: f(x) 
is continuous if, for any infinitesimal a, the difference f(x+a)—f(x) = (f(«+an)—f (x)) 
is also infinitesimal. As he grew confident using infinitesimals, Cauchy manipulated 
them in the same way as he manipulated variable quantities. He imagined points and 
lines after the style of the ancient Greeks, where a line is an entity that has ‘length but 
no breadth’ and a point is an entity that has ‘position but no size’. A line is an entity in 
its own right. A point can be marked on a line and a line can be drawn through a point. 
Furthermore, points can be constant, remaining in a fixed position, or variable, including 
infinitesimals as variables that tend to zero. 

What is less obvious is the nature of the number line itself. Not only is it possible 
to mark rational points on it, but there are also points such as /2 or z that arise in 
geometric constructions but are not rational. At the end of the nineteenth century, a 
solution was found by formulating the completeness property of the number line, so 
that it includes not only the rational numbers, but also the limits of convergent rational 
sequences. This notion of the reals cannot include infinitesimals if they are defined as 
“smaller than any positive real number’, because this concept is logically contradictory. 

A significant change in interpretation then occurred. Instead of a line being something 
drawn physically by the stroke of a pen, or imagined theoretically as a Platonic entity 
with length but no breadth, it was re-conceptualised as the set of all real numbers — 
defined, either as limits of convergent (that is, Cauchy) sequences of rational numbers, 
or as Dedekind cuts on the rationals [21]. Now every real number occupies a fixed place 
on the number line. It cannot move around, or become ‘arbitrarily small’, so the formal 
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real number line cannot include infinitesimals in Cauchy’s sense. Indeed, the notion of a 
‘variable’ was reformulated as a fixed but arbitrary member of some set. Movement was 
effectively banished from the fundamentals of mathematics. In consequence, the notion 
of an infinitesimal was excluded from formal set-theoretic analysis. 

Yet the idea that quantities can vary, and become arbitrarily small, remains in the 
mental imagery of practising mathematicians, especially in applied mathematics. The 
recent arrival of dynamic computer graphics offers an entirely new practical way to 
visualise mathematical ideas. Greek geometers drew a curve as a mark in the sand; later 
generations drew it in ink on paper, or printed it in a book. Today we can represent a 
curve on a high-resolution visual display, and zoom in on part of the curve to magnify 
the image. Unlike the magnification of a picture in a textbook, where the thickness of 
the graph increases under a magnifying glass, we can now move our fingers apart on 
a tablet computer to magnify the picture while the graph is redrawn at the same visual 
thickness. 

As we zoom in on a standard smooth graph such as y = x,y = e*, or y = sinx, 
the magnified picture looks less and less curved. At successively higher magnifications, 
a small portion of the graph looks more and more like a straight line, and the graph 
and the tangent become visually indistinguishable, see Figure 15.3. Now, under high 
magnification, a picture of a smooth graph magnifies locally to look like a straight line. 

This suggests a new way of imagining the tangent to a differentiable function: under 
high magnification, the graph looks less and less curved, until at suitably high magni- 
fication a small portion of the graph looks straight. The derivative is now visually the 
slope of the graph, and as we cast an eye along the graph, the derivative is the changing 
slope of the graph itself. This property can be visualised for combinations of standard 
polynomial, trigonometric, logarithmic, and exponential functions, but it needs to be 
formally defined to reach modern standards of rigour, and suitably interpreted to apply 
to complex analysis and higher-dimensional vector calculus. 


The Relationship Between Real and Complex Analysis 


The generalisation from the real to the complex case follows the same symbolic 
formulation, but the visual representations are different. In the real case, we can draw 


| 


Figure 15.3 Successive magnifications of a smooth curve. 
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trace images 
A of trace 
Zz 
point here f Kia 
tae Prrrnnrnnr——> w=f(z) 
SS z-plane w-plane 
— 
w-plane 


z-plane 


Figure 15.4 Yop: Tracing a point in the z-plane to see its image move in the w-plane. Bottom: 
Visualising this in three dimensions by stacking planes. 


the graph of a function f : D > Rasa subset {(x, f(x)) € R? : x € D}, but this cannot be 
directly generalised to the complex case because the graph is a subset of C x C, which 
requires four real dimensions. However, we can use alternative representations in two 
or three dimensions. For example, Figure 15.4 (top) represents the complex function 
w = f(z) by drawing parts of the z-plane and the w-plane side by side. If a finger starting 
at a point z traces a path in the z-plane, then the corresponding point w = f(z) moves 
around in the w-plane. 

This function can be visualised in three-dimensional space by placing the w-plane 
above the z-plane, with an arrow connecting z to f(z). This representation is similar to 
that used in Figure 2.9, but with an extra dimension in the z-plane. 

Then, as z moves in the z-plane along a path y, the arrow moves along the image path 
Jy, tracing out the motion of w = f(z), Figure 15.4 (bottom). 

The movement of the finger travelling smoothly along a curve in the z-plane without 
stopping can be represented formally by a path y : [a,b] — C, where y is differentiable 
and y’(t) € O fort € [a,b]. Recall from Definition 6.21 that we call such a path a regular 
curve. In this case, the tangent to y at z9 = y(t) makes an angle arc (y‘(to)) with the 
real axis. In Section 13.3, for an analytic function f, we distinguished between regular 
points zo where f’(zo) # O and critical points where f’(zo) = 0. The behaviour of the 
transformed path fy near wo = f(zo) depends on this distinction. The transformed image 
path w = f(y(a) satisfies (f 7)’ (to) = f’(zo)y‘ (to), and if zo is a regular point of f, that is 
f' (zo) # O, then the image path has a tangent at w = f(zo) as in Figure 15.5. The tangent 
makes an angle 


arc ( f'(zo)y'(to)) = are (f’(zo)) + are (y’(to)) 
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Figure 15.5 Mapping a path y in D by an analytic function f where f’(zo) 4 0. 


Figure 15.6 Tangent to curve in the w-plane. 


with the real axis. The transformation f turns the tangent at a regular point through an 
angle arc(y’(to)). 


15.2.1 Critical Points 


In Section 13.3 we compared the geometry of a differentiable complex function f near 
a regular point and near a critical point. For a regular point zo, the tangent to a regular 
curve y through z has a corresponding tangent through f (zo). 


Example 15.1. Consider the function f(z) = sin z at the origin where f’(0) = cosO = 1. 
The path y(f) = t+ it for t € [—1, 1] in the z-plane is transformed into 
f(v@) = sin(t + if) = sin( cos(it) + cos(f) sin(it) 
= sin(t) cosh(t) + cos(f) sinh(f) 


in the w-plane, see Figure 15.6. 


At a critical point, f’(zo) = 0, so t collapses to a single point: t(z + h) = wo. In 
Chapter 13 we noted that f need not be conformal at a critical point. In this case, a graph 
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plane -plane 


Figure 15.7 Transforming the path y(t) = te! by the function w = z?. 


y in the z-plane passing through zo is transformed by an analytic function f in a subtler 
manner. 


Example 15.2. Consider the function f(z) = 2” at the origin where f’(0) = 0. The path 
y(t) = te’? fort € [—r,r] in the z-plane is transformed by f into the composition 


fyvO= a cos(20) + it? sin(20 ) 


in the w-plane. (Recall we now write fy instead of f oy, for brevity.) The original path y 
in the z-plane starts at —re' and moves in a straight line through the origin to the point 
re on the other side; the path fy in the w-plane lies on the straight line at an angle 20 
to the real axis, starting at (—r)7e79 = re? moving through the origin and turning 
back in the opposite direction to r7e”"’, see Figure 15.7. 

This apparently sudden turn in direction turns out to be a direct generalisation of 
the real case. If the real function f(x) = x* is drawn with separate x- and y-axes, as x 
increases from —r to +r through the origin, the y-value x* starts at 77, moves down to 
the origin, and then turns back up again to the starting point. 

An alternative way to grasp what is happening is to represent the path y : [a, b] > 
C by writing fy(t) = u(t) + it) and looking at the graph of (¢, u(t), v(t)) in three- 
dimensional (t,u,v)-space. In Figure 15.8 the w-plane is drawn horizontally and the 
t-axis vertically upwards. As t moves up the vertical axis from —r to 1, (¢, u(t), v(t)) 
moves along a path in three dimensions, with the tangent at the origin being vertical 
where its projection onto the w-plane is a single point. Looking down on the w-plane, 
the point (u(t), v(t)) moves from r?e!? to the origin and turns back again. 

Imagining ¢ as time, the velocity of the moving point on the pathf is (fy)/(t) = 2te”?, 
and as ¢t passes through the origin the velocity smoothly slows down to zero, and then 
reverses without any sudden change in speed. 


More generally, if a point moves along any smooth path y through a critical point in 
the z-plane, then the path fy in the w-plane may turn back in the opposite direction. In 
three-dimensional (¢, u, v)-space, however, it travels smoothly through the critical point. 
For example, for w = z”, the path y(f) = 1—e! (¢ € [—1, 1]) is acircle radius 1, centre 
1, which travels through the critical point at the origin in the z-plane. Meanwhile the path 
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Figure 15.8 The pathfy in (¢, u, v)-space. 


Figure 15.9 Magnifying a path in the w-plane through a critical point of f. 


in the w-plane passes through the origin as in Figure 15.9. Magnifying the picture in the 
w-plane near the origin reveals the path moving in and out in what looks like a straight 
line. 


Interpreting Power Series Tending to Zero as Infinitesimals 


Using the idea that an infinitesimal quantity is a variable that tends to zero, the variable 
part of a power series can be interpreted as an infinitesimal. Moreover, it has a special 
property that relates to Leibniz’s idea that infinitesimals have an order of size. Recall 
from Definition 13.10 that an analytic function f : D —> C is of order n at z € D 
(where n > 1) if f™(zo) A 0, and f (zy) = 0 for 1 < k < n. By (13.6) the Taylor series 
about zo then has the form 


f(zo +h) =f (zo) HAF (zo) /nt ++ 


which is a constant f(zo) plus an infinitesimal of order n. So the behaviour of f(z) near a 
critical point zo is given by the order of the variable part of the analytic function at the 
point. The definition applies to regular points where n = 1, and to critical points where 
n > I, to give a single theory in all cases. 
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Using polar coordinates, we observed that if h = re’, where h is small, then f trans- 
forms a small region of the z-plane near Zo to the w-plane near wo = f (zo). The radius 
r scales to kr”, and the original angle @ is multiplied by n and rotates to n@ + a, as we 
saw in Chapter 13, Figure 13.6. Moreover, we noted that a further rotation of the line 
from Zp to Z) + h in the z-plane through @ rotates the corresponding line in the w-plane 
to n(0 + @)+ a, as in Figure 13.7. 

The case of a straight line passing through zp can now be considered as two half- 
lines where ¢ = z. The image in the w-plane (for suitably small /) is then made up 
of two half-lines, one at an angle 6 to the real axis, the other at an angle 0 + 2. Under 
the transformation f, the angle between them is increased to nz. For n odd, this gives 
two half-lines in opposite directions and, for n even, it gives two half-lines in the same 
direction. 

This is a natural generalisation of the real case where, for n odd, as x increases through 
zero, so does y, while for n even, the graph of y = x” has a minimum at x = 0, and as x 
passes through the origin, y travels down to zero and turns back up again. 

The language used in the preceding discussion has been expressed in terms of a 
variable h that is ‘sufficiently small’, linking to the Leibniz notion of infinitesimals of 
various orders of size and the dynamic idea of ‘arbitrarily small’ quantities in applied 
mathematics. We now introduce a formal set-theoretic definition of infinitesimals and 
prove a structure theorem that reveals how they can be visualised on a number line or in 
a complex plane. 


Real Infinitesimals as Variable Points on a Number Line 


The structure of the human visual system makes it natural for us to imagine points 
moving along a line. Given a vertical line in the real plane of the form x = v, consider 
where various polynomials intersect this vertical line as it is moved to the left or right, 
Figure 15.10. A constant function y = k always meets the line at the same height k, but 
the line y = x meets the line at a height v, and as the line moves, v varies. As v gets 
smaller, the variable point v moves below any fixed positive constant k. In this sense, 
as v tends to zero, the variable point becomes less than k for any real number k — one 


“ 


Figure 15.10 Infinitesimals as variable points on a line. 
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interpretation of ‘infinitesimal’. In the same sense, the variable point 7 is smaller still — 
infinitesimal compared to v. 

The vertical line x = v can now be seen to have two kinds of points. Some are 
familiar constant points that remain in the same position, while others are variable, and 
can become arbitrarily small. Others can become even smaller. 


Infinitesimals as Elements of an Ordered Field 


The plan now is to extend the real numbers R to a larger ordered field K that contains 
infinitesimals, and to extend real functions f so that we can calculate f(x) for x € K. 
Taking a cue from the previous section, we might begin with the field R(v) of rational 
functions in v, 

ay + av t+++++dnv" 

bo + biv+-++++bmv" 
where dp, @1,.--;Gn,b9,b1,...,bm € R and b,, 4 0, and give it the structure of an 
ordered field by defining v to satisfy 0 < v < k for all positive k € R. This lets us 
evaluate rational functions that include infinitesimals. 

For instance, if f(x) = x7 /(x — 1) we can extend f to include calculations with the 

infinitesimal v by substitution: 


flxtv)=(4+v?/a+v—l eR) 


To extend real functions f : D — R so that we can calculate f(x + h) when x € D 
and / is infinitesimal, we work in a suitable larger field K. The field K = R() is fine for 
rational functions, but not for more general functions. Leibniz’s version of calculus deals 
with polynomials, trigonometric functions, exponential functions, and, more generally, 
any function that can be expressed as a power series. In this case we need a larger field 
that includes power series in an infinitesimal. If we include a single infinitesimal ¢, then 
we also need all the elements that are produced by performing the usual operations of 
addition, subtraction, multiplication, and division with power series so that the system is 
an ordered field. In particular, the field should include all power series in ¢ and inverses 
of infinitesimals such as 1 /e. 


Foundational Example 15.3. (The Superreal Field Generated by a Single Infinitesimal). 
The superreal field IR, is defined to consist of all series of the form 


CO 
r 
) Ar € 
r>=n 


where n is an integer (which may be negative). To obtain a field, the series are added, 
subtracted and multiplied term by term. Division in the form 


(ee) / (Eee) Eo 
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is achieved by solving the equation 


[o,@) CO CO 
r r r 
) Cré ) bre’ } = ) ar€ 
rmm-+n r=n r=m 


recursively for successive coefficients Cn+n, Cm+n+1,-- + 

Here we do not need to worry about convergence, because all the operations are purely 
formal. It is possible to give an alternative set-theoretic definition in which the superreals 
are defined to be functions s : Z — R, where each s has an integer n such that s(r) = 0 
for r <n. The function s : Z — R then corresponds to the infinite series )°~-, are”. 
Knowing that the superreal numbers can be given a formal set-theoretic definition, we 


can safely think of them as infinite series in e. 


DEFINITION 15.4. An element )0°°, a;é" is an infinitesimal of order n if an # 0 and 
n > 1. Itis finite if n > 0 and infinite of order k ifn < 0,a, 4 Oandk = —n. 


Examples 15.5. (i) ©? + 17e7° isa third order infinitesimal. 
(ii) 0,25,7 +e, e°% are finite. 
(iii) 1/e and e~!! — 15¢ are infinite. 


(iv) sine =e-—& /3! +--+ is a first order infinitesimal. 


The term ‘order’ is used here in a special sense referring to the ‘order of size’ of an 
infinitesimal as distinct from the order a < b in the field. There may be two elements 
a < bwhere the order of a as an infinitesimal is greater than the order of b. An example 
is a = —e? and b = «. If there is a danger of ambiguity, we refer to the order of a single 
element a in the sense of Definition 15.4 as the order of infinitesimality of a, and the 
order a < b between two elements as the linear order between a, b. 

The order of infinitesimality of a = }°°° _, a,e” is the subscript n of the dominant term 
a,, which is the first non-zero term in the sum. 

The linear order a < b also depends on the dominant terms of a,b. If a = 

a are’,b = pee be", then without loss of generality we can assume m < n. 
If m = n then the linear order of a, b is the same as the linear order of am, bn. Ifm <n, 
then a,, dominates and the linear order is given by the sign of a,,. (In the latter case, 
bm = 0, so the same rule applies: in this case the linear order of a, bis the same as the 
linear order of am, 0.) 


Examples 15.6. (i) «2 < 3e. 
(i) —I1/e < Oare finite. 
(iii) ¢€ <r for every positive r € R. 
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The superreal field generated by just one infinitesimal has many of the properties that 
Leibniz wanted. Every non-zero element is either finite or has a specific order as an 
infinitesimal, or as an infinite element. Any real function expressible as a power series 
f(x) = OS, arx’ can be extended to define f(x + 5) for an infinitesimal 6 = )°&,,, bse* 
by substituting 


fa%t+ts)= \ ae [+ Ye 


and then rearranging the result as )~~,, c-x” where cy € Re. This process looks messy, 
but it can easily be done term by term. (See the exercises at the end of the chapter.) 

Our choice of R, is made initially for pedagogical purposes; in particular, to investi- 
gate intuitive models for limits and derivatives in terms of genuine infinitesimals. This 
field has an explicit description, but it does not have all the properties we would like. For 
example, the exponential function is not defined at 1/e. Indeed, exp(1 /e) has a series 
involving all negative powers of €, not just a finite number. This feature causes no prob- 
lems in the sequel, but it is a limitation of the current approach to infinitesimals, and it 
can be avoided altogether by working with the hyperreal numbers, Section 15.10. 

The same technique works in the complex case, where C, is the supercomplex field of 
elements )~~,, cre” where the coefficients c, are complex numbers. Complex functions 
F(Z) given by power series can be extended to apply tof(z+6), where 6 is infinitesimal. 
However, complex analysis is much simpler than real analysis because every differ- 
entiable complex function can be represented locally by a power series. Introducing 
infinitesimals into real analysis therefore requires a more powerful extension of the real 
numbers. 

We begin with a simple definition that proves amazingly effective. 


DEFINITION 15.7. An ordered field K that contains the real numbers as a proper 
ordered subfield is a super ordered field. 


Examples of super ordered fields include the field of rational functions, the superreal 
numbers IR,, and many others, such as the hyperreal numbers that are studied in non- 
standard analysis, Section 15.9. 

A super ordered field has all the familiar properties of an ordered field. It also contains 
infinite and infinitesimal elements defined as follows: 


DEFINITION 15.8. In any super ordered field K, an element x € K is: 


positive infinite if x > rforallre R 
negative infinite if x <r forallre R 
finite ifa <x < bforsomea,be R 
positive infinitesimal if 0 < x <r for all positive re R 


negative infinitesimal if 0 < —x < r for all positive r € R. 


Using the standard properties of an ordered field it is straightforward to prove that an 
element x is infinitesimal if and only if 1/x is (positive or negative) infinite. (These and 
other properties of arithmetic and order are left as exercises at the end of the chapter.) 
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Structure Theorem for any Ordered Extension Field of R 


The key to visualising the infinitesimal structure in the field R; is a simple but powerful 
structure theorem: 


THEOREM 15.9. Every element x in a super ordered field K is either infinite, or 
uniquely of the form x = c + h where c € R and his zero or infinitesimal. 


Proof. We need consider only the case where x is finite, so that a < x < b, where 
a,b € R. Let S = {r € R: r < x}. Then S is non-empty because a € S, and bounded 
above (by b). By completeness of IR, the set S has a unique least upper bound c. Let 
h =x—c,sox =c+h. Now we use completeness to show that if h 4 0, then h is 
infinitesimal. There are two cases: 

For h > 0, suppose that there exists k € R such thatO < k < h. Thenc+k < 
c+h= x, so by definition of S,c+k € S. But this implies that c +k is greater than the 
least upper bound c. By contradiction, h is infinitesimal. 

For h < 0, suppose there exists k € R such thatO <k < —h. Thenx=c+h< 
c—k <c, so by definition c — k is an upper bound of S. But c — k is less than the least 
upper bound c, again giving a contradiction. So h is infinitesimal. 

To prove uniqueness, suppose that c +h = d+ k, where c,d € R and h,k are 
infinitesimal. Then c — d = k — his infinitesimal or zero, and real. Hence it is zero. We 
have now proved that any finite x € K is uniquely of the form x = c+h, wherec € R 


and /ris infinitesimal or zero. 


We extract a key concept: 


DEFINITION 15.10. The real number c is called the standard part of x and is written 
c=st(x). 


This structure theorem is the key to imagining infinitesimal quantities on a number 
line. Over the centuries, we have expanded our interpretation of a number line, from 
the Greek idea of a line where the distance between two points is the same regardless 
of direction, to a number line with positive and negative numbers, rational numbers, 
irrational numbers, and real numbers. There is no logical reason to stop there. A super 
ordered field, has finite, infinite, and infinitesimal elements. The structure theorem tells 
us that any finite element is (uniquely) a real number plus an infinitesimal. An infinites- 
imal is, of course, too tiny to see to a normal scale, but this is not as problematic as it 
seems to be at first sight (pun intended). For the real numbers, on a normal scale we can- 
not see the difference between the irrational number z and the rational number given by 
z to a thousand decimal places. However, if we magnified a small part of the picture by 
a factor 101°, the difference would become apparent. This simple idea offers the way 
ahead: we can magnify a super ordered field to distinguish infinitesimally close points 
visually. First, we establish some simple ideas relating finite elements and infinitesimals. 


PROPOSITION 15.11. Let F be the subset of finite elements in K and I the subset of 
elements that are zero or infinitesimal. Then a,b € F,c,d € I imply a+b,a—b,abe€ 
F,ce+d,c—d,acel. 
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Proof. These follow from the definitions. 


Proposition 15.11 states that the sum, difference, and product of finite numbers are 
also finite, but the quotient a/b need not be finite (if b is infinitesimal). The sum and 
difference of infinitesimal elements are infinitesimal, and the product of a non-zero finite 
element by an infinitesimal is again infinitesimal. (Anyone familiar with ring theory will 
recognise that F is a ring and J is an ideal of F; however, this is not essential to follow 
the ideas in this chapter.) 


PROPOSITION 15.12. For x,y € F, the standard part map st: F > R satisfies 
st(x + y) = st(x) + st(y) 
st(x — y) = st(x) — st(y) 
st (xy) = st (x) st (y) 
st (x/y) = st(x)/st(y) (for st(y) # 0) 


Proof. Use algebraic manipulation of the definition, taking care with division. 


Visualising Infinitesimals as Points on a Number Line 


The structure theorem offers new ways of visualising infinitesimal quantities as points 
on a number line. A standard picture cannot distinguish points that differ by an infinites- 
imal, because the difference is too small to see, or represent infinite quantities, because 
they are too far away. But a linear map m(x) = ax + b for appropriate values of a,b € K 
reveals infinitesimal or infinite detail: 


DEFINITION 15.13. The map m : K — K where m(x) = (x —c)/d forc,d 4 Ois 
the d-lens pointed at c. If d is infinitesimal, then m is a microscope pointed at d. If cis 


infinite then m is a telescope pointed at infinity. 
The set 


V = {x € K : m(x) 1s finite} 


is the field of view of m. 


The field of view V of a d-lens contains those elements x € K whose image m(x) is 
finite, and these images can be marked on a finite line in a picture. For instance, if c 
is a real number and 6 is infinitesimal, the field of view V of a 5-microscope pointed 
at c contains c,c + 6,c + 26. These differ by infinitesimal quantities, but they map to 
mc) = 0,m(c+ 6) = 1,m(c + 26) = 2, which are now visibly different. At the same 
time, the points c+6 andc+5+3652 map to m(c+6) = 1,m(c+6+362) = 1+36, which 
differ by an infinitesimal and cannot be distinguished at this scale. See Figure 15.11. 


When we make a map in a real-world situation, say a map of England including 
London, we mark the location of London using the same name. This technique can 
be used to advantage when magnifying infinitesimal detail using a 6-lens. Renaming the 
image points with their original names, as in Figure 15.12, the picture may be interpreted 
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Figure 15.12 Scaling up infinitesimal detail. 


as scaling up part of the number line K to distinguish points that differ by an infinitesimal 
amount. 

Using a specific map, it is possible to see only a certain level of detail, because 
some differences in the image are too small to distinguish, and some are too far away. 
Leibniz imagined infinitesimals of order 1,2,3,... and in the above pictures we have 
included only examples of polynomials in an infinitesimal that follow Leibniz’s intu- 
ition. However, we must be prepared for more general situations that occur in practice. 
For example, if h is an infinitesimal of order 1 and h has a square root r € K , then 
r> = h andr must have order 5. To deal with the general case, instead of referring to 
the specific order of infinitesimality as a whole number, we compare the relative order 
of size of two elements a, b as follows: 


DEFINITION 15.14. Ifb € K andb# 0, an element a € K is: 


of higher order than b if a/b is infinitesimal, 
of the same order as b if a/b is finite, 
of lower order than b if a/b is infinite. 


When visualising infinitesimal detail using a 6-lens pointed at c, the field of view 
V consists of elements that differ from c by an element of the same or higher order. 
Those differing by the same order map onto distinct points: those that differ by a 
higher order quantity are indistinguishable to the naked eye. We can model this visually 
using: 


DEFINITION 15.15. The optical 5-lens or optical microscope ts pointed at c is pw : 
V — R where p(x) = st (% — c)/8). 
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Figure 15.13 Scaling points in the field of view onto the whole real line. 


The image is now a drawing of the real line. Points in V that differ by an element of 
the same order as 6 map on to different real numbers and points that differ by a higher 
order than 5 map onto the same real number. Meanwhile, the 6-lens m(x) = (x —c)/6 
maps points outside the field of view V onto infinite elements of K, which are beyond 
the range of a finite drawing. In Figure 15.13 the line representing the field K is greyed 
out to show that only the point c and the field of view V are magnified to give a visible 
point on the real line. 

In general, the optical microscope 4 : V > R always maps the field of view V onto 
the whole of IR because pu(c + x6) = x for any x € R. 

A special case occurs when we take c = 0 and 6 = 1. Now the field of view is the set 
of finite numbers F’, and the optical 6-lens is the standard part function st. The image is 
the whole real line and the points mapping onto a particular real number x € R consist 
precisely of x and those points differing from x by an infinitesimal. This suggests: 


DEFINITION 15.16. The monad M, around x € K is 


M,={x+heK:he} 


where J is the subset of K consisting of zero and infinitesimals. 


Some sources use the alternative name ‘halo’ in place of ‘monad’. 

In this definition, x may be any element of K, so any infinite value of x is also sur- 
rounded by its own monad. If x is finite, then by the structure theorem of Theorem 15.9, 
there is only one real number c € M, and the standard part map st : F — R collapses 
the monad M, down onto the real number c. 


DEFINITION 15.17. The relationship x ~ y is defined on K by x © y if and only if 
x—yel. 


This is an equivalence relation and the monads are the equivalence classes. It lets us 
imagine the picture of the super ordered field K as points on a line, where the finite 
points F consist of the real numbers and each real number c is surrounded by the monad 
M, of elements c + h where h is infinitesimal. 

This vision lets us see that any super ordered field K is not complete. We need: 


THEOREM 15.18. Fora € R, the monad Mg is bounded above by any real number 
b > a, but has no least upper bound. 


Proof. If 1 € K is a least upper bound of M,, then either ] € M, or! € M;, for b > a. 
In the first case, for any positive infinitesimal h, 1+ h € Mz is greater than the supposed 
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upper bound / of M,. In the second, / — h € M,j is an upper bound for M, that is smaller 
than the least upper bound /. By contradiction, M, has no least upper bound. 


COROLLARY 15.19. A super ordered field is not complete. 


This should not be a surprise. A complete ordered field is unique up to isomorphism 
and does not contain infinitesimals. So a super ordered field that contains infinitesimals 
cannot be complete. The structure theorem, Theorem 15.9, tells us why. 

Nevertheless, we now have a fundamentally new way of visualising ‘the number line’. 
Not only does it include the familiar real numbers; it can include infinitesimals and 
infinite quantities that can be seen by human eyes only through appropriate d-lenses. 


Complex Infinitesimals 


The generalisation of infinitesimals in any super real field K can be extended to the 
complex case by using quantities of the form x + iy where x, y € K and i* = —1. 


DEFINITION 15.20. Given a super ordered field K, the corresponding super complex 
field is the field K[i] of polynomials in i where it = —1. It is an extension field of the 
field of complex numbers C = R{[i]. It is a field because the polynomial 1 + 1 has 
no zeros in any ordered field, so is irreducible. Alternatively, a direct proof follows the 
usual one for C by writing (x + iy)! in the form u + iv. 

An element z = x+iy where x, y € K is infinitesimal if both x and y are infinitesimal, 
finite if both x and y are finite, and infinite if at least one of x, y is infinite. The standard 


part of a finite element z = x + iy is 
st (z) = st (x) +1st(y) 


An element w is of higher order than a non-zero element z if w/z is infinitesimal; of the 
same order if w/z is finite and non-zero; and of lower order if w/z is infinite. 

Let Fj be the set of finite elements in K[i] and J the set of elements that are infinites- 
imal or zero. (Again Fj is a ring and J; is an ideal in F with the properties specified in 


Proposition 15.11.) 


THEOREM 15.21 (Structure Theorem for a Super Complex Field). Every element z = 
x + 1y in a super complex field is either infinite or uniquely of the form z = w + h where 
w € Cis given by w = st(z) and h is zero or infinitesimal. 
For finite u, v, the standard part map satisfies 
st(u+ v) = st(@) + st(v) 
st(u — v) = st(u) — st(v) 
st (uv) = st (u) st (v) 
st(u/v) = st(u)/st(v) (for st(v) #0) 

st(x + iy) = st(x) +ist(y) 
Proof. If zis finite, then both x and y are finite elements in K, and the structure theorem 
for super ordered fields shows that they are uniquely of the form x =c+p,y=d+q 
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where c,d € Randp,g €I,soz=w+hwherew=c+id €C,andh =p+igé i. 
The properties of the map st follow by direct calculation. 


Many definitions and properties generalise from the real case, such as: 


DEFINITION 15.22. The monad M,; around z € K[i] is 


{ct+he Ki]: heh} 


The monads partition K[i] into non-overlapping subsets consisting of points that differ 
by an infinitesimal. This includes the case where z may be infinite, so that even an infinite 
element is surrounded by a monad of points differing from it by an infinitesimal. Finite 
elements in K[i] consist of the underlying field of complex numbers together with the 
monad M, surrounding each complex number z. 


DEFINITION 15.23. If d 4 0, the d-lens pointed at win a super complex field is 
Z—wW 
d 


The field of view of mis the set of points z in the super complex field such that (z— w)/d 
is finite. 


m(Z) = 


The optical d-lens pointed at w is 


y(z) = st(m(z) = st (=) 


If d is infinitesimal, the d-lens is called a microscope, and if w is infinite, it is called a 


telescope. 


Multiplying by a complex number in the form re!’ scales by a factor r and turns 
through an angle 9, so it is sensible to take 6 = 0 and to select d to be of the form 
d = 6 +10 where 6 > 0. (Such an element will be called a positive infinitesimal.) Then 
the 6-lens simply scales by a factor 6 and does not rotate the image. For this reason, we 
will always assume that the scaling factor 6 is a positive element of the subfield K. 


Example 15.24. Figure 15.14 shows the point zy = re’, where r,@ € R, lying on the 

circle |z| = rin K[i], zo +h = re*+® lies on the circle where ¢ is an infinitesimal in K. 

Let «z be an optical microscope ju(z) = st (z — zo)/d, where 5 is the same order as €. 
Then 


h=re®t — re? 
= re'(el® — 1) 
= zo(ie — 67/2 +---) 
= zoié + higher order terms 
So (Zo) + h = (Zo + Zole), and the point z) + on the circle is mapped optically to 
the same point as zo + zoié. 
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Figure 15.14 An infinitesimal part of a circle under optical magnification. 


Multiplying by ie rotates the plane through a right angle (multiplying by i) and scales 
by the real factor ¢, so Zz) + Zoi€é is on the tangent at right angles to the radius from 0 
to z. Furthermore, for any A € R, taking € = 46 maps zo + zol€é onto a general point 
zo +zoiA on the tangent, so that the optical magnification transforms the part of the unit 
circle in the field of view of jz onto the whole tangent line. 

In this very precise sense, an infinitesimal portion of the graph of a circle is ‘locally 
straight’ and is indistinguishable from an infinitesimal part of the tangent line when 
viewed through an optical microscope. 


Non-standard Analysis and Hyperreals 


In the early days of calculus, functions were given by formulas. When the formula is 
a polynomial or a power series — which includes functions such as exponentials and 
trigonometric functions — the value of f(x + h) for infinitesimal h can be calculated 
in the superreal numbers R, in the real case, or the supercomplex numbers C, in the 
complex case. The derivative may then be calculated as 


f(x +h) —f@) 
h 


f'(® =st ( ) for infinitesimal h 

The fields R, and C, are sufficient for differentiation. By the Fundamental Theorem 
of Calculus, they are also sufficient for integration, using an antiderivative. However, 
there is no mechanism in these fields to deal with more general functions such as 
sequences s : N — R, and they do not permit a suitable definition of the Riemann 
integral. When the real numbers were formulated set-theoretically at the end of the 
nineteenth century, infinitesimals had no place in formal epsilon-delta analysis. 

In 1966, Abraham Robinson [17] made the formal use of infinitesimals rigorous using 
mathematical logic. He introduced an extension field *R of R, the hyperreal numbers, 
which includes infinitesimals and provides natural extensions of all of the basic func- 
tions of analysis. It is based on a distinction between two different types of logical 
statement, as follows. 

The axioms for arithmetic and order in R quantify properties of elements of IR. These 
include 


forallx,ye R:x+y=y+x 
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or 


there exists 0 € R such that: for allx ¢ R,x +0 =x 


or 


for all x,y,z¢ R:x > y,y > zimpliesx > z 
Just one axiom — the axiom of completeness — quantifies properties of sets: 


for all sets S C R, if S is non-empty and bounded above, 


then it has a least upper bound. 


Corollary 15.19 proves that super ordered fields are not complete, because monads 
are bounded above but fail to have least upper bounds. This is a subtle clue. When 
Robinson introduced the hyperreal numbers, he observed that properties that quantify 
elements (first order logic) generalise to *IR, but properties quantifying sets (second 
order logic) may not. Writing properties using the universal quantifier V (meaning ‘for 
all’) and J (‘there exists’), most properties of arithmetic quantify elements, such as 


VxyEe Rixty=yt+x 


or 


AOE R:Vxe R:x+0=x 


These use first order logic, but the completeness axiom — which quantifies sets — does 
not. 

The solution is to find a system of hyperreal numbers *IR for which any first order 
logical statement that is true for the real numbers R remains true in the extended system 
*R. For example 


Vx,y,E"*Rixty=yt+x 
or 
40 e*R: Vx e*R:x+0=x. 


remain true. 
However, second order logical statements, such as the completeness axiom, need not 
extend to *R. 


DEFINITION 15.25. The system of hyperreal numbers *R is a proper ordered field 
extension of R where every subset D C R has an extension D C *D C *R and every 
function f : D > R has an extension f : *D — R that satisfies: 

Transfer Princple: Every true statement about the real numbers R expressed in 


first order logic is true in the extended system *R. 


(It is convenient use the same symbol for the extended function as for f to avoid 
proliferation of *s. This should not cause confusion.) We talk of ‘the’ hyperreals, but the 
construction does not lead to a unique result; however, any of them does the same job, 
so we assume that a particular one has been selected. 
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There are two possible approaches to the hyperreal numbers. The first is to assume 
that such a system exists and then build up a theory based on the definition. The second 
is to provide a formal existence proof, or better, a construction, of a hyperreal number 
system. 

The first approach, though it does not prove the actual existence of hyperreal numbers, 
has great potential. For example, assuming its existence, it is possible to prove that the 
extension *N of the natural numbers N must have infinite elements. The proof is simple. 
The first order statement 


Vx Ee Rane Nin>x 


states that for every real number x, there is always a bigger natural number n. Its 
extension says that 


Vxe*Rine*N:in>x 


and, since we know that *R contains infinite elements, so does *N. 

This immediately gives a new way of thinking about the limit of a sequence as a 
function s : N — R that extends tos : *N — *R. Consider the elements s(N) for 
infinite N and take the standard part st (s(V)). If this is the same for all infinite NV, then 
this common value is the limit of the sequence. 

For instance, if 


then the limit is 
6N? +N 6+1/N 6+0 
sts(V) = st | —,—— } = st = -—2 
3N2 + 1 3+1/N* 3+0 


Limits still have to be calculated, but the definitions become simpler. For instance, the 


usual definition of the limit s of a sequence s,, of real numbers is 
Ve>OINEN:n>N= |sn—5| <8 
Using the hyperreals, this becomes, as just explained: 
For all infiniten € *N: s = st(s,) 


In general, definitions using the hyperreals (called non-standard analysis) involve 
fewer quantifiers than the standard definition. For instance, pointwise continuity of 
f :D — Rhas a standard definition: 


Vx Ee DVe ER(e > 0) 55 € R(S > 0) Vx’ ED: |x-x'| <8 > |f(x) -—f’)| <e€ 
and uniform continuity of f : D — R has a standard definition: 


Vx € DVx' € DVe € R(e > 0) 35 ER(S> 0): |x -x'| <8 S If(X)—-f@)| <e 


Each has four quantifiers that are highly complicated for the human mind to hold and 
manipulate all at once. The corresponding non-standard definitions are, for pointwise 
continuity: 


Vx € DVx' €*D:x—X infinitesimal > f(x) — f(x’) infinitesimal 
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and for uniform continuity: 
Vx,x' €*D:x—x’' infinitesimal > f(x) — f(x’) infinitesimal 


which have only two quantifiers and are much simpler. 
Outline of the Construction of Hyperreal Numbers 


It is possible to use the idea of hyperreals based on the transfer principle, without proving 
that hyperreals exist. But to put the idea on a sound logical basis, it is important to show 
that such a system can be constructed from known mathematics. 

Broadly speaking, the construction of *R from R follows a similar overall strategy to 
the one that Cantor used to construct the real numbers R from the rational numbers Q. 
His construction begins with the set of all Cauchy sequences (a,) of rational numbers; 
where a sequence is Cauchy if for any ¢ > 0 there exists N € N such that m,n > N > 
|dm — 4| < €. Then he sets up an equivalence relation: 


(ay) ~ (bn) = (ay -_ by) —>0 


Then R is defined to be the set of equivalence classes of Cauchy sequences, and the 
operations of arithmetic and order are derived from operations on Cauchy sequences. 
The construction of the hyperreal numbers *IR from R follows a similar overall pattern. 
It begins with all sequences (7) of real numbers, sets up an appropriate (and cunningly 
defined) equivalence relation ~, and defines *R to be the set of equivalence classes. The 
field *IR is considered as an extension of R by identifying the real number r with the 
equivalence class [r,], where r, = r for all n. The detailed construction requires the 
notion of an ‘ultrafilter’, akin to the axiom of choice. For a more detailed discussion, see 
Robinson [17] or Stewart and Tall [21]. 

Once the equivalence relation is set up, extending real sets and functions to hyperreal 
sets and functions is exquisitely simple. The extension of any subset S C R to *S C*R 
is defined by: 


*S = {[rn] € *R: ry, € S for all n € N} 


The extension set *S is precisely the set of equivalence classes of sequences whose 
terms all lie in S. The extension of any function f : S > R tof : *S > *R is equally 
straightforward. Just define 


f(r) =  @n)] for all [rn] € *S 


If subsets of the real numbers or real functions are formulated set-theoretically using 
statements that quantify only elements, then those sets and functions can be extended to 
the hyperreal numbers using the same statements. For instance, if S = {x € R: x > 0} 
and f(x) = ./x, then*S = {x € *R: x > O}. Because every x € S satisfies x > 0 
and f(x)? = x, it follows that for every x € *S, the same equation holds, so f(x) is the 
(positive) square root of x. For a positive infinitesimal x, this means that ./x is also a 
positive infinitesimal whose square is x. 
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The theory also extends to more general real functions f : S > R”, where S C R”, to 
give an extension f : *$ — *R” that satisfies the transfer theorem for logical statements 
quantifying several elements in R. In particular, since every complex function w = 
f(z) can be expressed as a real function of two variables taking (x,y) to (u, v), where 
zZ = x-+iy and w = u-+iy, the hyperreal theory generalises naturally to the complex 
case. 


Hypercomplex Numbers 


To treat complex analysis in a similar manner we need the complex analogue of *R. 
The hypercomplex numbers *C are of the form a + ib, where a,b € *R and i? = —1. 
(As for hyperreals, the word ‘the’ assumes a particular choice for *R, hence for *C.) 
They generalise the real case so that any function f : S — C extends to a function 
f : *S — *C that satisfies the transfer principle: any logical statement quantifying 
elements in C remains true in *C. For example, suppose that a complex analytic function 
f : D — Con a domain D is expressed as a power series f(zq + 1) = pan, a,h" for 
he€ C, |h| < R. Then the extended function f : *D — *C is given by the same formula 
for h € *C, |h| < R. This makes the infinitesimal extension of complex analysis simpler 
than the real case. 

In particular, for any infinitesimal ¢ € C, the values of functions can be calculated 
using power series in €, so we need only work in the field C,. This, of course, is a 
subfield of the hypercomplex numbers *C, defined in the same way as R, but using 
complex coefficients in the power series in «. 

For any infinitesimal 6, the optical 5-lens jz pointing at zo € Cis defined by 


Z— £0 


Wz) = st 


Its field of view is the set of elements z) + h where h/6 is finite, so h = 5k where k is 
finite. This lies within the monad M,, of points lying an infinitesimal distance from Zo. 

The field of view need not be the whole of the monad. For instance, if we take 
any infinitesimal ¢ and choose 6 = e* then zo + € does not lie in the field of view 
of the optical 6-lens jz because ¢/5 = 1/e is infinite. Essentially the optical micro- 
scope lets us see detail differing from zo by the same order as 6, but lower order 
detail is too small and higher order detail is too far away to be seen in an optical 
image. 

Writing 1/5 in polar coordinates as 1/6 = re!’, the lens magnifies distances from zo 
by the (infinite) factor r and rotates through the angle 6. Choosing 4 to be any positive 
infinitesimal in *IR gives 8 = 0 so that the optical microscope jz magnifies the picture 
by a factor r without any rotation. For this reason, when using an optical microscope we 
take the magnification factor 6 to be a positive infinitesimal in *R. 


DEFINITION 15.26. A subset S € *C is called an infinitesimal 5-shape near zo € Cif 
it is of the form S = {z +h: z € C, his a 6-infinitesimal}. 
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Figure 15.15 Magnifying an infinitesimal 5-shape using an optical d-microscope. 


Example 15.27. For an infinitesimal 6 € *R, the subset 
{eo the *C:2€C,—-6 <h < 5} 


is an infinitesimal 6-shaped square in *C of side 26, centred on zo. Using the standard 
convention that the image j(z) is also denoted by the symbol z, this lets us see the 
infinitesimal shape in the complex plane C using the optical microscope jz, see Fig- 
ure 15.15. 


An analytic function f : D — C then extends naturally to f : *D — *C, and 
an infinitesimal shape near zo in *D transforms to an infinitesimal shape in *C near 
wo = f (Zo). 

In the z-plane, let z9 + h be in the field of view of the optical microscope p(z) = 
st ((z — zo)/5). Then 


Lzo + h) = st (h/S) 


Using an optical microscope v(w) = st((w — wo)/6d) in the w-plane with the same 
magnifying factor 6, 


v(f(Zo + h)) = st ((f(Zo +h) — f(Zo))/8) 
= st (((f(zo + A) — f(z0))/h)st (h/8) 
= f’(zo)u(zo + A) 


The complex number j(zZo + /) is transformed to its optical image v(f(zo + h)) 
multiplied by f’(zo). Writing f’(zo) in polar coordinates, f’(zo) = re, this scales the 
picture by a factor r = |f’(zo)| and turns it through an angle @ = arc (f’(zo)), see 
Figure 15.16. 

If f(a) #€ O, the transformation of the optical image is conformal, turning the 
infinitesimal shape precisely through the angle arg f’(zo) and scaling lengths by the 
factor |f’(zo)|. 

If f’(zo) = 0, the shape collapses to a single point. In this case, the detail in the 
w-plane involves infinitesimals of higher order. In Section 15.2 the Taylor series of f 
near zo is written in the form 
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Figure 15.16 Magnifying a transformed infinitesimal shape. 


f(zo +h) =f (zo) + hf (zo) /n! +++ 


where n > 1 and f(zo) ¢ 0. This is interpreted as the sum of the constant f(zo) plus 
the infinitesimal h"f“(zo)/n! + --- of order n in the variable h. 
Using an optical microscope of order 6” gives 


v(w) = st ((w — wo)/8”) 
so 


v(f (zo + h)) = st (f(zo + h) — f(Z0)/8”) 
= st H'f (zo))/nt/8") 
= st (H"/8")f (zo))/n! 
= st (h/8)"f (Zo)/n! 
= (zo + hy" (z0)/n! 
Write p(z9 +h) = re? and f™(zo)/n! = ke!” in polar coordinates, to get 
v(f (zo + re?)) = rel” ke’ 
so 
lv(f(zo + rel?))| = kr” 
and 
arc v(f (zo + h)) = nO +a 
This effect can be seen by starting with an infinitesimal 5-shape in the z-plane in the 
form of a sector of a circle centre zo, with two infinitesimal sides of radius 6, at (finite) 
angles 0 and 0 + ¢ to the real axis. The shape is then transformed in the w-plane as an 
infinitesimal sector that can be seen through an optical 6”-lens as a sector of a circle, 
centre Wo = f(zo), with the radius to f(zo + A) turned through an angle né + a, the angle 
between the two radii stretched to ng, and the lengths of each radius scaled from r to 
kr", see Figure 15.17. 


In the case of an infinitesimal 5-shape in the form of a circular path f(y(O) (ft € 
[0, 277]) then, as the circle is traced once round the critical point, the image path fy 
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Figure 15.17 Transformation of an infinitesimal shape at a critical point of order n. 


Figure 15.18 Optical image of a point moving round an infinitesimal circular path. 


traces round the corresponding infinitesimal circle in the w-plane n times. This relates 
to the theory of Riemann surfaces, where a point z in the z-plane moves once round a 
critical point as its image w = f(z) traces round the corresponding path in the w-plane 
n times. It is visualised in Figure 15.18 with part of the z-plane magnified by an optical 
5-microscope pointed at zo and the corresponding part of the w-plane magnified by an 
optical 6”-microscope pointed at wo. In the figure the image in the w-plane is represented 
by a horizontal plane in three dimensions and the path on the Riemann surface directly 
above has n levels. As y(t) moves in the z-plane from 0 to 27/n the corresponding 
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point on the Riemann surface moves once round the initial level and then moves up on 
successive levels as t moves from 2rz/n to 2(r + 1)z/n, returning to the initial level 
when t = 277. 

This phenomenon applies to every point z9 € S. Furthermore, each infinitesimal 6- 
circle lies within the monad M,,, and each monad contains just one complex number 
zo. If z € Zo then an infinitesimal 5-circle around z lies in the distinct monad M,. The 
behaviour of the function near any point z € C therefore depends only on the order of 
infinitesimality of the function f at the point z. 

This completes the picture of the behaviour of regular analytic functions from an 
infinitesimal viewpoint. An analytic function f : D — C has a natural extension f : 
*D — *C which generalises to a function from *D to the Riemann surface of f with 
multiple levels involving points in *C . This extension then has a unique analytic inverse 
from the Riemann surface back to *D. 


The Evolution of Meaning in Real and Complex Analysis 


At this point we are in a position to look back on the historical evolution of real and 
complex analysis to gain a broader view of the current state of the theory as part of an 
ongoing development that will continue to evolve in the future. The historical path taken 
depends on the prevailing ideas at the time and may later be seen in an entirely different 
light. We review relevant parts of Chapter 0 and then move on to the modern era. 


A Brief History 


Initially the calculus developed by Leibniz and Newton focused on the relationship 
between variables represented as curves in the plane. On the continent, the infinitesi- 
mal methods of Leibniz prevailed, while in England, Newton began by thinking of a 
variable x changing in time, which he called a ‘fluxion’, and its rate of change x’, which 
he called a ‘fluent’. He also formulated the Binomial Theorem in the form 


(x+ 0)? =x" 4+ nx" o+ inn —1)x"-292 +... 
leading to the expansion 


n n 
are —x =pxt=! agi 5n(n — 1)x'-2 +... 
This holds good not only for a whole number n, but also for fractional and negative 
powers, which expresses (x + 0)” as a power series in 0, and as o becomes small, leads 
to the term in o being small enough to neglect, giving the derivative of x’ as nx”~!. 

In the eighteenth century, work on the calculus by Euler and others focused on real and 
complex power series manipulated as symbols to give significant results whose logical 
status would be questioned by later generations. It was not until the turn of the nineteenth 
century that Wessel, Argand, Gauss, and others began interpreting the complex numbers 
as points in the plane. Derivatives were calculated using the same formula as the real 
case, while complex integrals were taken along contours in the plane from Zo to z;. As in 
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the real case, if f has an antiderivative F in its domain D then I iq f(2dz = F(z) —F (zo), 
so the Fundamental Theorem of Calculus generalises directly. But then the theory of real 
and complex functions began to diverge spectacularly. In real analysis, functions with 
unexpected properties were found: functions differentiable once but not twice; func- 
tions differentiable n times but not n + 1; and infinitely differentiable functions (such as 
f@® =e" ” for x # 0 and f(O) = 0) that do not equal their Taylor series expansion, 
even though it converges. There are even functions that are continuous everywhere and 
differentiable nowhere. Mathematicians realised that precise definitions and proofs were 
needed to cope with all these possibilities. 

Meanwhile, it was discovered that a complex function f differentiable in a domain 
D can be represented as a power series in a small disc around any point in D. At the 
same time, the arithmetisation of analysis led to the set-theoretic definition of the real 
numbers as a complete ordered field, which cannot include infinitesimals. 

This resulted in two parallel developments in the twentieth century with very dif- 
ferent philosophies. Pure mathematics excluded infinitesimals because they did not fit 
into the real number system, while applied mathematicians used them informally but 
productively to model dynamic situations. Robinson’s theory of non-standard analysis 
redeemed infinitesimals using mathematical logic. However, it was ignored by most 
applied mathematicians, who had no interest in that level of rigour, thought in terms of 
approximations, and were happy to discard any sufficiently small term if that proved 
fruitful, without asking further questions. It was actively resisted by many pure mathe- 
maticians, who had invested considerable effort in epsilon-delta analysis and preferred 
to remain with a familiar theory, or who preferred to avoid the subtle logical machinery 
of ultrafilters, with its reliance on the axiom of choice. 

In principle, any theorem in standard real analysis that can be proved using non- 
standard analysis has a standard proof, so non-standard analysis may seem to offer 
no advantages. However, proofs using non-standard analysis are often simpler than 
epsilon-delta, once the initial investment of mastering the set-up has been made. Some 
important theorems currently have only non-standard proofs, because an epsilon-delta 
proof gets bogged down in too many e’s. So non-standard analysis has become an 
accepted technical tool in mainstream research. However, it is still a very specialised 
area. 


Non-standard Analysis in Mathematics Education 


The intuitive appeal of infinitesimals, as opposed to the difficult machinery of epsilon- 
delta methods, offers some educational advantages for students meeting analysis for the 
first time. Some programmes have used this approach successfully, but they have not 
been widely adopted. 

The developments in this chapter reveal these apparently opposing views to be two 
sides of the same underlying coin. Applied mathematics focuses on variables that can 
“become small’ while pure mathematics focuses on fixed mathematical objects defined 
in epsilon-delta terms. Both can now be seen in a broader set-theoretical context in 
which real and complex numbers lie in larger structures that contain infinitesimals. The 
elements of such extension fields of R and C may be termed ‘quantities’. A quantity 
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x is defined to be infinite if |x| > r for all r € R. It is finite if it is not infinite, and 
infinitesimal if 1/x is infinite. Elements in R are called ‘real constants’ and those in C 
are ‘complex constants’. This gives a formal language to describe an extension of the 
number line or complex plane that resonates with the use of the term ‘infinitesimal’ in 
historical development. 

The structure theorems prove that every finite quantity x is either a constant or 
uniquely of the form x = a+ ¢, where a is either a real or complex constant and ¢ 
is a (real or complex) infinitesimal. The unique constant a is called the standard part of 
x. Optical microscopes let us see infinitesimal detail as a genuine picture on a line or in 
the plane. This returns us to an intuitive human way of visualising infinitesimal detail 
that resonates with earlier conceptions of the calculus that were rejected in mathematical 
analysis at the beginning of the twentieth century. 

In 1972 Stroyan [22] introduced the idea of infinitesimal microscopes and telescopes. 
These were used in Keisler’s undergraduate text Foundations of Infinitesimal Calculus 
based on Robinson’s non-standard analysis, Keisler [10]. In 1980 Tall [23] modified 
Stroyan’s definition to include optical lenses and applied the theory to the simpler system 
of superreal numbers. 

In his book Visual Complex Analysis, Needham [14] sees a complex analytic function 
w = f(z) transforming the z-plane into the w-plane. Figure 15.19 shows f(z) = 22 
transforming a shape in the z-plane to its image in the w-plane. 

The black T-shape in the z-plane is transformed visually into a T-shape in the w-plane 
that is scaled in size and rotated through an angle. The scaling and rotation become 
more precise as the shape becomes smaller. In particular, near a point z where f’(z) 4 0, 
a small shape is scaled approximately by a factor [f’(z)| and turned through the angle 
arg f’(z). Needham introduced the term ‘amplitwist’ to indicate that the transformation 
‘amplifies’ the scale by a factor |f’(z)| and ‘twists’ the shape through the angle arg f’(z). 
He used the term ‘infinitesimal’ in a technical way that ‘does not refer to some mystical, 
infinitely small quantity’. Instead (in Needham [14], pages 20—21) he suggested ‘two 
intuitive ways of speaking’, formulated as follows: 


...1f two quantities X and Y depend on a third quantity 6, then 
lims_s0 % =1 <=> ‘X =YY for infinitesimal 5’ 


<=> Xand ¥ are ultimately equal as 6 tends to zero. 


Figure 15.19 A visual transformation from the z-plane to the w-plane. 
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Figure 15.20 Polar coordinates r, 0, with increments dr, d@. 


This does not explicitly say what an ‘infinitesimal’ is. The approach in the current 
chapter defines an infinitesimal formally as an element in an ordered field extension K 
of the real numbers or in the corresponding complex extension field K[i]. 


Human Visual Senses 


Thinking of an infinitesimal as a variable that is ‘becoming small’ is a natural human 
way to visualise infinitesimal ideas. Our eyes and brains have been wired by evolution to 
detect moving objects. This way of thinking also has practical advantages over thinking 
of an infinitesimal only as a tiny physical quantity. For example, using polar coordinates, 
Figure 15.20 shows (r, 6) being given a further increment dr in the radius and dé in the 
angle. 

If r,0,dr,d0 are all finite quantities, we can draw this as a finite picture. But if r and 
@ are finite and dr, d6 are infinitesimal, we cannot draw the whole picture to the same 
scale. Using an optical 6-lens to magnify the infinitesimal shape sides dr by rdO to give 
a finite picture requires dr,d6, and 6 all to have the same order. If r # 0, then r has 
higher order than 4, so if a 5-lens is used magnify the shape dr to reveal a finite picture, 
the origin, distance r away, lies outside the field of view. On the other hand, if ris drawn 
to a finite scale, the optical picture of the infinitesimal shape reduces to a single point. 

This means that while we may imagine infinitesimals as arbitrarily small fixed ele- 
ments in an extension field of the real or complex numbers, when making a physical 
drawing, it is more natural to think of them as variable quantities that become very 
small. 

Recent research into how the human eye works shows that imagining points moving 
smoothly is a natural human facility. When we read text, our eye has a sharp focus which 
only reads a few consecutive letters (around four or five with standard size type) and our 
eye jumps along the line taking in information, attempting to build up the meaning of 
successive stretches of text. If you look at a paragraph on this page and focus on how 
you read it, you can sense these jumps (called saccades) as the eye moves along the text. 
(Try it now.) However, if you look at an object moving smoothly, there is an initial jump 
to lock on to the object in the sharp central vision, and the eye then follows the object in 
a smooth movement. This means that it is natural to imagine a point moving smoothly 
along a line. 
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Computer Graphics 


New technologies are evolving which let users dynamically control movement on a high- 
resolution visual display, smoothly zooming in on graphs or moving objects around. 
Software such as Mathematica and GeoGebra let a human operator code mathemati- 
cal ideas symbolically, and manipulate pictures dynamically to explore mathematical 
relationships. 

Placing the w-plane above the z-plane lets a three-dimensional view be drawn on a 
two-dimensional dynamic display, where z = x + iy is represented on the z-plane as 
(x,y, 0) and w = u + iv is represented on the w-plane as (u,v, 1). As z = x + iy moves 
around in the z-plane, the corresponding value w = f(z) = u + iv moves around in 
the w-plane. The transformation from z to w can be signified by an arrow linking z to 
f(z). As z varies, this gives a mapping diagram from the z-plane to the w-plane as shown 
earlier in the lower part of Figure 15.4. 

Figure 15.21 shows a display created in GeoGebra (see Flashman [6] to explore the 
dynamic software on the internet). It is in three framed areas. The central area shows the 
mapping diagram for the function f(z) = z+ pz +q from the z-plane to the w-plane 
for p = —1.9,q = 3.3. On the left is an area showing the z-plane and a circle radius 
6 = 1.0 with m = 24 points equally spaced around it. The area on the right shows the 
graph of the real function y = x3 + px + q. On the screen are boxes to input the formula 
for the function f and the variables p and q, as well as sliders with movable points to 
vary p, g, 6, and m. The central area can also be selected to change the viewpoint or the 
scale. 


Figure 15.21 Using Geogebra to explore the cubic function f(z) = + pz+q. You can interact 
with this worksheet at www.cambridge.org/gb/S&T-geogebra 
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New possibilities arise that cannot be adequately described using a static picture in a 
textbook. The design provides two boxes to tick: one (MD) hides or shows the Mapping 
Diagram, which consists of the arrows from the points around the circle radius 6 in the 
z-plane to the w-plane; the other shows or hides the roots of the equation f(z) = 0. 

The three roots marked rj, 72, r3 are shown in the panel on the left, where r3 is on the 
real axis, and r; ,r2 are complex conjugates. Moving g up or down moves the real graph 
in the right frame dynamically up or down by the same amount. The graph as pictured 
has one real root near —2, and two complex conjugates near 1 +1. As q decreases, the 
real cubic curve moves down by the same amount until the function has three real roots. 
As this happens, the roots rj, rz in the left frame move towards each other until they 
coalesce as a repeated real root, then separate into two distinct real roots, so that the 
cubic has three real roots. 

The user can manipulate the picture dynamically to focus on aspects of interest. For 
example, the left panel has an icon ® which, when clicked, causes the chosen parameter 
(here p) to vary dynamically over its given range (here —5 to 5) and, as it does so, 
it displays the changing situation of the roots of the real cubic as the equation shifts 
between having one real and two complex conjugate roots to having three real roots. 

The GeoGebra environment allows the user to create and explore dynamic figures in 
complex analysis, Flashman [5]. 


Summary 


Interpretations of complex analysis continue to evolve into the future. New approaches 
in modern times are part of a more comprehensive overall picture in which intu- 
ition may be translated into rigour and conversely, proving structure theorems returns 
formal rigour to more sophisticated forms of symbolic manipulation and visual 
intuition. 

This broader vision reveals the fundamental distinction between real and complex 
analysis. Real analysis considers functions that exhibit many different properties and 
require highly sophisticated construction of the hyperreal numbers to cope with the logi- 
cal introduction of infinitesimals. Meanwhile, complex differentiable functions are given 
locally by power series around any point in a connected open domain and these can be 
manipulated in the much simpler extension field C, generated by a single infinitesimal e. 
This simpler extension encompasses the more powerful theory of analytic continuation 
and Riemann surfaces. From this higher vantage point we can see, at a single glance, the 
distinction between the technically complicated theory of real analysis and the subtle 
sophistication of complex analysis. 


Exercises 


1. Figure 15.22 shows the path y : [—2z,2] — C given by y(t) = ¢ + it and the 
analytic function w = f(z) = cos z. Calculate fy, y’(t), and (fy)’(t) as functions of 
t. Interpreting ¢ as time, consider the points y(#) and (fy )(t) moving along the paths 
imy,imfy. By comparing the motion in the w-plane in the intervals —z < t < 0 
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im( /7) 


z-plane w-plane 


Figure 15.22 Geometry for Exercise 4. 


and 0 < t < 7, or otherwise, explain what happens in the w-plane as f¢ passes 
through 0. 

2. Let y : [-1,1] > C be given by y(f) = # + if’. Use a graph plotter to draw 
the parametric graph of (¢3, 7) and magnify the graph at various points. Conjecture 
what the highly magnified portions of the graph will look like for t = 0 and t ¥ 0. 
Calculate y(t) numerically for t = —0.001,t = 0,t = 0.001 and draw the graph 
for —0.001 < t < 0.001 on a standard piece of paper. What happens if the graph is 
scaled from t = —10~® to +10~° ona standard piece of paper? 

3. For f(z) = z, plot the paths y)() = 1+it (-1 <t< 1), ~w@® = —-1+4+it(-1 <t< 
1), y3(4) = it (—1 < t < 1) in the z-plane and their transformed paths fy, , fy2, fy; 
in the w-plane where w = f(z). Imagine ¢t increasing smoothly in time from t = —1 
to t = 1 and trace the movement of the corresponding point along each graph with 
your finger. One has a different behaviour from the other two. Which one? Explain 
why, by considering the critical point of f. 

4. Determine the Taylor expansion f(zo + h) of the following functions about a point 
zo and deduce the order of the variable part f(zo + h) —f(zo) at that point: 

(i) sing —zatz =0 
(ii) cos z at z = O and at z =i 
(iii) 2° at zo = O and at z £0 
(iv) The principal value Logz at z = 1 

5. Let K be a proper ordered field extension of the real numbers R. For x € K, write 
down the definition for x to be finite, positive infinite, negative infinite, infinite, 
positive infinitesimal, negative infinitesimal, infinitesimal. In each of the following 
cases, decide whether the given statement is true or false for all x,y € K. If it is 
true, give a proof; if it is false, provide a counterexample (for instance, using the 
field K = R(v) of rational functions in a variable v). 

(i) x negative infinite implies 1 /x is negative infinitesimal 
(ii) x infinite implies 1/x is infinitesimal 
(iii) x infinitesimal implies 1/ x is infinite 
(iv) x finite, y infinitesimal implies xy is finite 
(v) x finite and non-zero, y infinitesimal implies xy is infinitesimal 
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10. 


11. 


(vi) x infinitesimal, y infinitesimal implies x/y is infinitesimal 
(vii) x finite, y infinite implies xy is infinite 


. For x € K[i] where i2 = —1, write down the definition for x to be finite, infinite, 


infinitesimal. In each of the following cases, decide whether the given statement is 
true or false for all x,y € Ki]. If it is true, give a proof; if it is false, provide a 
counterexample. 
(i) x infinite implies 1/x is infinitesimal 
(ii) ) x infinitesimal implies 1/x is infinite 
(ili) x finite, y infinitesimal implies xy is finite 
(iv) x finite, y infinitesimal implies xy is infinitesimal 
(v) x infinitesimal, y infinitesimal implies x/y is infinitesimal 
(vi) x finite, y infinite implies xy is infinite. 


. In the field K = RQ) of rational functions in a variable v, place the following 


elements in linear order and in order of infinitesimality: 


v, 1 —v, 1/0 —v),-1/, v/W - 1) 


. Prove Proposition 15.11 that if F is the subset of finite elements in a super ordered 


field K and I is the subset of elements that are zero or infinitesimal, then 


a,b € Fandc,d € Iimpliesa+ b,a—b,abé€ F,c+d,c—d,cdél 


. Prove Proposition 15.12 that if x, y € F, where F is the subset of finite elements in 


a super ordered field, then the standard part map st satisfies 


st (x + y) = st(x) + st(y), 
st (x — y) = st(x) — st(y), 
st (xy) = st (x) st (y) 
st (x/y) = st (x)/st (y) (for st (y) 4 0). 


Calculate the coefficients Cm+n, Cmtn+1, Cm+n+2, Cm+n+3 Which arise when division 


E+) E)-(E 


in the superreal field R, and the supercomplex field C,, is determined by solving 
the equation 


(Cmpne +--+ (cpe? ++ (dpe ++ - + (dpe? +--+) = (aQne™ +: > +(ape” +--+) 


for successive coefficients Cin, Cmtnti>+ ++ 


Show that there are some positive infinitesimals 6 € R, that fail to have a square 
root in R,. Nevertheless, show that, for any z = x + iy € C, where x, y € Rg, the 
modulus |z| = . x2 + y* can be calculated as an element in R,. (Hint: each of x, y 
is either zero, or of the form ae”(1 + 5), where 6 is infinitesimal.) 
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12. For an infinitesimal 6, calculate which of the following are finite and which 


13. 


14. 


15. 


16. 


are infinite (using power series for trigonometric or exponential functions where 
necessary): 
@) 36 + 1/6 
(ii) sin(izr + 3) 
(iii) 57 sin(1/5) 
(iv) sin(izr + 4) 
(v) elt té 
(vi) + fork eC 
In those cases where the element is finite, calculate its standard part. 


For f : D — C (where D is a domain in C) given by the following formulas, 
calculate the standard part of (f(z + h) — f(z))/h in a super complex field, where h 
is an infinitesimal, and hence find the derivative f’(z) for z € D. 

@ 227 
(ii) 1/(z — 2) for z #2 
(iii) 1 +2)" forne N 
(iv) sinz (in a super complex field including power series expansions) 
(v) 22 sinz 
Write down the definition for hyperreal numbers *R, hypercomplex numbers *C 
and the transfer principle for hyperreal and hypercomplex numbers. 

(i) Use the first order statement Vx € R dn € N: n > x to deduce that there exist 

infinite hypernatural numbers N € *N, where N ¢ N. 
(ii) By considering the property 


Vma,neN:m>n>m-—-neN 


use the transfer principle to show that if m is an infinite hypernatural number 
and nis a natural number, then m — n is an infinite hypernatural number. 

(aii) Let E = {(2n EN: n EN}, O= {2n—1 €N: n € N} be the set of even 
and odd natural numbers. Use the transfer principle to show that every infinite 
hypernatural number JN is either even (in *E) or odd (in *O) and that N is even 
if and only if N + 1 is odd. 


Using the hypercomplex numbers, by considering the standard part of zy for infinite 
N, determine whether the sequence (z,) converges. If it does, calculate the limit, if 
not, explain why. 

(i) (n? + sinn)/(n3 — 2n7) 


Gi) PER tee forke Ck 1 
2n? +3n—17 
(ii) 1 7 Gy fees (5)""! for n odd, et for neven 
(n* + 3n)(n + 1) 
|. w®+sinn 1 
(iv) —+—x—> for n odd, 2 + — forn even 
n° — 2n n 


Visit the Geogebra website www.geogebra.org/ and use it to investigate some of the 
curves in C and R? discussed in this book. 
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Homology Version of Cauchy’s 
Theorem 


In Chapter 8 we proved the classical version of Cauchy’s Theorem, using step paths to 
simplify the geometry and topology involved. In Chapter 9 we developed the concept of 
homotopy, and reformulated Cauchy’s Theorem in that framework. Homotopy is one of 
the basic ideas in algebraic topology. A related topological concept, homology, emerged 
a little earlier from work on the classification of surfaces. This notion and its extensions 
are of great importance in algebraic topology. When applied to domains in C, it provides 
fresh insight into Cauchy’s Theorem and its consequences. 

In complex analysis we view homology as a topological property of adomain D C C, 
but we define it in terms of properties of paths in D. There are then deep connections 
between the homology of D and the behaviour of complex integration along paths in D. 
In this respect homology is like homotopy, which is also a topological property defined 
in terms of paths. The two concepts are related, but different. For homotopy, two paths 
in a domain are considered to be equivalent if one can be continuously deformed into the 
other. In the context of homology, two paths in a domain are considered to be equivalent 
if they differ by a path that forms the boundary OR of a rectangle R whose image lies in 
D, as described in Section 9.3 — and, by extension, if they differ by a finite set of such 
boundary paths. 

In this chapter, we develop a homology version of Cauchy’s Theorem, and examine a 
few of its consequences, including a homology version of Cauchy’s Residue Theorem. 
The material here is a natural continuation from Chapter 9. We have postponed it until 
now to avoid delaying the more standard material in Chapters 10-14. The main ideas 
are simple, elegant, and geometrically intuitive, although the formal setting takes a little 
getting used to. Also, proofs have a habit of getting tangled up in combinatorial argu- 
ments about step paths. Here, algebra often comes to our rescue. The topology is fairly 
subtle, and results that seem obvious sometimes require a little care if we want a rigor- 
ous proof. We work throughout with step paths that lie in a domain D C C. Recall that a 
domain, in this context, is open and connected; moreover, being connected is equivalent 
to being path-connected. Cauchy’s Theorem concerns the integral of a complex function 
on D over a closed step path in D. 


REMARK 16.1. For the figures in the chapter, we often do not draw paths explicitly as 
step paths, because lots of little steps everywhere just make the figures more complicated 


without adding any understanding. We draw the steps only when they matter. 
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Outline of Chapter 


We sketch the main ideas of this chapter now. Formal definitions and proofs come later. 
Throughout this chapter, we abbreviate ‘step path’ to ‘path’, except when we want to 
emphasise that we are working with step paths. Our terminology and approach to homol- 
ogy are tailored to the material in this text, and adapted to the simpler geometry of step 
paths. Topologists generally set up homology in a more general, more abstract, and more 
elegant manner, see for example Hatcher [8] or Hocking and Young [9]. 

Homology provides a natural context for several features of complex integrals that we 
have previously noticed: 


e Integrals are additive: the integral of a given function along a sum of paths is the sum 
of its integrals along the separate paths. 

e Moreover, the integral of the function along the reverse of a path is minus the integral 

along the path, allowing subtraction as well as addition. 

Cauchy’s theorem for a boundary, Theorem 9.6, states that if R is a rectangle and 

@ : R — Discontinuous, and f : D — C is differentiable, then the integral of f 

along the boundary of @ vanishes. 


On the other hand, the integral of f along an arbitrary closed path need not vanish. Its 
value depends on how the path winds around points not in D. 


All of these facts are closely related, suggesting that they can all be put together in a 
natural context. The question is: what? 

Our definition of a sum of paths in Section 2.8 involves joining them end to end, 
and we have to impose special conditions for that to be possible. This corresponds to 
the classical view that a function should be integrated ‘along’ a path. However, we can 
extend integration to any finite set of paths, even if they do not join up to create a single 
path, just by adding the separate integrals together. A path y can be repeated any number 
of times to give a path ny for n € N \ {0}. For negative n we can define ny to be —n 
times the reverse path —y. Finally, the empty set of paths plays the role of 0. So we can 
define an arbitrary integer combination 


MY, ++ FIM Yn 


of paths y;, for m; € Z. 

At first glance, these combinations seem to form a group (indeed, an abelian group) 
under addition, but there is a technical problem: if y is a path, y + (—y) ought to be 
the zero element of this group, which is empty. Actually, as described above, it is the set 
{y, —y}, which is not empty. 

On the other hand, the integral of f along y + (—y) is always zero, so this discrepancy 
has no effect on integrals. All of which suggests that we must somehow redefine y + 
(—y) to be zero. Mathematicians have a standard trick to resolve this kind of difficulty: 
introduce an equivalence relation in which y + (—y) is equivalent to 0, and work with 
the equivalence classes. The equivalence relation we need is homology, and the clue for 
how to define it is Cauchy’s Theorem for a boundary. Essentially, we want any boundary 
path to be equivalent to zero. Everything else follows from that. 
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Since we are trying to construct a group, a natural way to achieve all this is to build 
in the group property right at the start. Then Cauchy’s Theorem for a boundary tells us 
how to define the equivalence relation of homology using group theory, where it takes a 
very simple, though more abstract, form. To accomplish this, we define integration over 
‘chains’. A chain is a formal integer combination 


P= my +--+ Mpyn 


where the y; are step paths (not necessarily closed) and the mj; €¢ Z. We explain what this 
means, how to formalise it, and how to visualise it, in Section 16.1. When all m; = 0 the 
chain is empty, and this possibility is allowed in the definition. 

Be warned: addition here is not quite the same as addition of paths as defined in 
Section 2.4, and —y is not quite the same as the reverse path to y. As we have just 
seen, the classical operations + and — run into technical problems, and we have to get 
round those. However, the classical notions and the abstract group operations are closely 
related. The precise relationship is given in Lemma 16.15: we can identify them ‘modulo 
homology’. 

The group of chains has two important subgroups. The first is the subgroup of 
‘cycles’, which correspond to formal sums of closed step paths, which we call ‘loops’. 
Inside the subgroup of cycles we define a further subgroup of ‘boundaries’. A boundary 
is now a formal integer combination 


B=mB, +---+m,B, 


where each 8; = OR; is the boundary of a rectangle in D, and the m; € Z. Every 
boundary of a rectangle is a loop, so in particular boundaries are cycles. 

By ‘rectangle in D’ we mean a continuous map @ : R — D such that the image 
@(R) © D, just as in Cauchy’s Theorem for a boundary. Chains, in contrast, are defined 
by the image @(0R), and in that case the map is not required to be defined on the interior 
of R, let alone to have its image a subset of D. Indeed, the distinction between these two 
cases is central to homology. 

To complete the set-up, two cycles Qj, Q2 are said to be ‘homologous’ if their dif- 
ference Q) — Q2 is a boundary. Another term for being a boundary is ‘homologous to 
Zero’. 

The machinery of chains, cycles, boundaries, and homology is beautifully adapted 
to natural properties of complex integrals along paths. Let f : D — C be a continuous 
function. Integration over paths can be generalised to integration over chains, hence also 
over cycles or boundaries, by defining 


[tam ptt fF 


n 


Boundaries come into play because we proved in Theorem 9.6 that the integral round 
the boundary of a rectangle whose image lies inside a domain is zero: 


by 
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So the integral of f over a cycle Q is unchanged if we add or subtract a series of bound- 
aries of rectangles — that is, if we add a boundary to Q. In other words, for any fixed f, 
the integral depends only on the homology class of the cycle. 

We also need to define winding numbers for chains, which we do by defining 


w(T, zo) = m1 w(y1, Zo) +--+ + mnw(yn, Z0) 


We already know that the winding number for a path can be specified by an integral as 
in (7.6). This relationship also holds for chains. 

Once these concepts have been set up, we come to the main aim of the chapter, which 
is to prove: 


THEOREM 16.2 (Homology Version of Cauchy’s Theorem). Let f : D > C bea 
continuous function, and let Q be a cycle in D. Then the following are equivalent: 


(i) Qis homologous to zero (that is, it is a boundary) in D. 
(ii) All integrals over Q vanish: 


/ f =0 for all differentiable f. 
Q 


(ii) All winding numbers of Q around points outside D vanish: 
w(Q,z0) = 0 forall zo ¢ D. 


The proof of this theorem occupies most of this chapter. When the proof is complete, 
we derive some useful consequences. 


Group-theoretic Interpretation 


Homology can be interpreted group-theoretically, as Emmy Noether noticed when 
attending lectures on topology by Heinz Hopf in 1926—7. Leopold Vietoris and Walther 
Mayer independently discovered the same interpretation at much the same time. Chains 
C, being formal integer combinations of specific objects (here paths), form an abelian 
group under formal addition. Cycles Z form a subgroup of C, and boundaries form a 
subgroup B of Z. Because the group of chains is abelian, these are normal subgroups. 
The equivalence relation of homology on Z determines a quotient group 


H = Z/B = cycles/boundaries 


Specifically, the equivalence classes for homology are precisely the cosets of 6 in Z. 

All four groups C, Z,B, and‘H depend on D. The group 7 is the first homology group 
of D, usually written H,(D). (In the topology of higher-dimensional spaces in R” and 
C” there are also ‘higher’ homology groups Hy (D) for k = 0,1, 2,3,..., but only H;(D) 
concerns us here.) The elements of 7 can be thought of as the ‘essentially different’ 
topological types of loops in D, up to homology. 

With hindsight, our previous results related to Cauchy’s Theorem offer strong hints 
at the existence of this algebraic structure. As already noted, integrals vanish on bound- 
aries. Paths can be added to give new paths, and integrals are additive. So are winding 
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numbers. Our aim is to develop a formal algebraic context that unifies these hints. 
Algebraic topology originated from these and related considerations. 

To make the discussion as self-contained as possible, our approach will be classical (a 
polite way to say ‘old-fashioned’). The modern approach to homology is more general 
and more abstract. 


Chains 


Let D C C be a domain. Recall that a (step) path in D is a map 
y: [a,b] > D 


with the step property (sum of finitely many line segments parallel to the real or 
imaginary axis). To keep the terminology simple, we define: 


DEFINITION 16.3. A loop in D is a closed step path y : [a,b] — D, so y(a) = 
y(b). 
Let 


P = {all paths in D} 


We turn P into an abelian group using an algebraic trick (which can turn any set into a 
subset of an abelian group): 


DEFINITION 16.4. A chain TY in Dis a formal integer combination 
T=my +--+ + myn (16.1) 


where the y; € P are distinct paths in D and the mj; € Z. 
The set of all chains is denoted by C. 


A more respectable, but less convenient, definition views the m; as the values of a 
function m : P — Z, which maps y; to m;. Thus we define 


Z(D) = {m:P > Z: m(y) = 0 for all except a finite number of y € P} 
We can pass between functions m and formal sums (16.1) as follows: 


(i) Given (16.1), define m by: 


_jm if yy 
mo ={ 0 if y #y; foranyj 


(ii) Given m, define 


P={y €L:m(y) £0} 
m=my;) fory ¢T 


We can add two such functions: 


(m, +m)(y) =m (vy) + m(y) 


and we then have: 
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D 


=) 


Figure 16.1 Visualising a chain. This is T = 3y) — 4y2 + 273. It so happens that yj is a loop. 


PROPOSITION 16.5. The set of chains C(D) is an abelian group under the operation +. 


Proof. The zero element of C(D) is the function m for which m(y) = 0 for all y € L. 
The additive inverse of a function m is the function —m defined by 


(—m)(y) = —(m(y)) 


It is easy to see that + is associative and commutative. 


Technically, C is the ‘free abelian group’ generated by the set of paths P. 

At first sight, it is not obvious how to visualise a formal sum of paths, espe- 
cially one with integer coefficients. One way is to draw the images of the different 
paths y; in the sum, and label each by the corresponding ‘multiplicity’ m;, as in 
Figure 16.1. 

The image of a cycle ! = }° mjy; is defined to be the union of the images of the y; 
for those j with mj A 0. 

We extend the definition of the integral and the winding number from loops to chains: 


DEFINITION 16.6. Let fF = )° mjy; be a chain, where the yj are paths and zo € C. 
Then define 


G@) 
[t=Dm fs 
T j Yj 


(ai) 


w,z0) = >) mjw(y;,20) 
J 


The definition trivially implies: 


PROPOSITION 16.7. The integral is additive: 


ae ae 
[,f=-]Y 
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Using Theorem 7.8 is it easy to prove: 
PROPOSITION 16.8. The winding number is additive: 
w(0, zo) = 0 


wT, +12, 2) = w1, 20) + wT, Z0) 
w(-T, Zo) a —w(T, zo) 


We can now generalise Section 7.5: 


THEOREM 16.9. Let! = )° mjy; be a chain, and let zo be any point in C that does not 
lie on the image of T. Then 
1 dz 
T 


271i Jp Z— Z0 


Proof. 


wT’, Zo) = s. mW(Yj,Z0) 
J 


1 dz 
= omg 
J 


yj ~~ %0 


1 / dz 
211 dX yj 


, £— ZO 
1 
~~ Ori rz-—Zo 


We sometimes need to make an explicit distinction between a path y : [a,b] > DC 
C and its image. To do so we write the image as 


y ={y(t): te [a,b]} 


We include the hat only when it might be confusing to omit it. 


16.2 Cycles 


Next we need a special type of chain, which we call a cycle. A cycle is a ‘closed chain’. 
That is, the group of cycles is generated by chains whose images decompose into a series 
of closed loops. That is, if the usual sum y, +--+ -+ yy is a loop, then the corresponding 
formal sum inC is a cycle. Then all formal sums of cycles are again cycles. By definition 
the cycles form a subgroup of C, which we denote by Z. 

Next we define a subgroup B of Z, whose elements we call ‘boundaries’, which 
consists of cycles that can be ignored when calculating integrals (and therefore winding 
numbers). That is, cycles Q such that 


/ f =0 for all differentiable f : D— C 
Q 
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(As aconvention, we use I’, y when discussing chains and (2, @ when discussing cycles.) 
If we let 


then Theorem 16.9 implies that all boundary cycles automatically also satisfy 
w(2,Z0) =9 (Zo ¢D) 


Which cycles should we use? By Theorem 9.4 the group B should include all bound- 
aries OR of rectangles R, as in Section 9.3. Several special loops of this type are 
particularly useful. 


Sums and Formal Sums of Paths 


We have already defined y + 6 and —y for paths: respectively, adjoin the two paths and 
combine them into a single one, and reverse the path. These operations are not those 
of C. However, we want them to be the same ‘up to homology’, see Section 16.4. That 
is, they should be equal modulo 8. We can then think of these geometric operations as 
being the corresponding operations in C (or the subgroup Z, which is more important 
here) whenever homologous chains can be considered equivalent. 

Recall from Section 6.8 that if y : [a,b] — Dand 6: [b,c] — D (not necessarily 
closed paths) we defined —y : [a,b] ~ Dandy +4: [a,c] ~ D by 


—yQ =y@+a-t) 


and 
y(t) if te€[a,bd] 


(y+ 6)@) = 6(t) if te [b,c] 


(The original definition was slightly more general, allowing the interval for 6 to be [c, d] 
and then shifting it to make c = b. That option is not needed here.) See Figure 16.2. 

Although we are using the same symbols + and —, these operations are not (quite!) 
those of the group C, and we need to sort out the precise relationship. To avoid confusion, 
we temporarily write the operations inC as @ and ©, and use + and — for the operations 
on paths that we defined in Chapter 6. 

First: they really are different. The path-sum y + 6, considered as an element of C, 
corresponds to the function m : P — Z such that 


my +6)= 1 


Figure 16.2 Previous definitions of —y and y + 6. Here both paths are loops, so they can also be 
thought of as cycles. 
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mo) =0 (o #y +8) 
In contrast, y @ 6 € C corresponds to the function m’ : P — Z such that 


m(y)=1 
m' (6) = 1 
m(o)=0 (6 #y,6) 


These are different functions because, as elements of P, wehavey @6 Ay, y@sb $4, 
so 


m(y @5)=0 my +6)=1 


Similarly, Gy (cycle) is not the same as —y (path). 


Boundaries 


The role of homology is, in effect, to force these technically different forms of addition 
and reversal to be the same. It exploits a standard trick encountered throughout mathe- 
matics: define an equivalence relation for which objects that differ in inessential ways 
are equivalent. Then their equivalence classes are the same. For instance, in arithmetic 
modulo n, a difference between integers that is divisible by n is inessential. We would 
like to set n = 0, but that is technically false. So we define 7, s to be congruent modulo 
nif r — s is divisible by n. Congruence is an equivalence relation, and the set of equiva- 
lence classes is denoted Z,. Now, although r and s are different in Z, they represent the 
same congruence class. 

In this case Z,, is the quotient group of the additive group of integers Z by the sub- 
group of nZ of multiples of n; that is, Z, = Z/(nZ). (The multiplicative structure can 
also be included, because nZ is an ideal of the ring Z.) We can play the same game with 
cycles in place of Z, and whatever differences we wish to ignore playing the role of nZ. 
In this case, the differences that we want to forget about are boundaries, in the following 
sense: 


DEFINITION 16.10. The group & of boundaries in D is the subgroup of Z generated 
by all boundaries OR of rectangles in D. 


To make the argument clear, we continue to use @,© in place of +,— in C to 
distinguish the group operations from the standard operations on paths. To get round 
one further technicality we go the whole hog and throw in all reparametrisations 
of y that preserve orientation. Consider an orientation-preserving homeomorphism 
p : [a,b] — [a,b]. That is, p is continuous and has a continuous inverse p~!, and 
p(a) = a, p(b) = b. Equivalently, o is monotonic strictly increasing on [a, b] and satis- 
fies p(a) = a, p(b) = b. Now yp(t) = y(e(t)) defines a loop yp. By Proposition 9.15, 
this change has no effect on integrals, hence on winding numbers. 

If we allow p to reverse orientation, the integral changes sign, which is why we 
require orientation to be preserved. An alternative is to allow orientation to be reversed 
as well. Then —y is just a special reparametrisation, and any orientation-reversing 
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Figure 16.3 Maps of rectangles used in the proof of Proposition 16.11. Top: Reversal. Middle: 
Sum. Bottom: Reparametrisation. 


reparametrisation is the map y +» —y composed with an orientation-preserving 
reparametrisation. 


PROPOSITION 16.11. Let y, 6 be paths in D and let yp be an orientation-preserving 
reparametrisation of y. Then 


y @(-y)€B (16.2) 
(vy 85)-(v+d)€B (16.3) 
yeypeB (16.4) 


Proof. In each case we specify a map ¢@ : R > D with a suitable boundary. Figure 16.3 
illustrates the underlying geometry. Assume that y : [a,b] — Dand6: [b,c] > D; 
also that p : [a,b] — [a,b] is the reparametrisation. 

For (16.2), let (t,5) € [a,b] x [0,1] = R and set 


b(t, 5) = vy) 


(so @ is independent of s). Then @(R) = y([a, b]) C D, and dg = y + (—y). 
For (16.3), let (t,5) € [a,c] x [0,1] = R and set 


y(t) if te [a,b] 
6(t) if te [b,c] 


Now @(R) = y([a, b])Ud([c, d]) C D, and 0¢ can be decomposed into (y @46)6 (y +4). 
For (16.4), let (t,5) € [a,b] x [0,1] = R and set 


d(t,s) = y(1 — s)t + sp(t)) 


o(t,s) = 
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Then $(R) =  ([a, b]) © D, and a6 = y — yp. 


PROPOSITION 16.12. IfQ € B then 
if f =0 for all differentiable f : D— C 
Q 
and 


w(2,Z0) =0 (Zo ¢D) 


Proof. It is enough to check that these conditions hold for boundaries of rectangles 
in D, because both the integral and winding number are additive by Propositions 16.7 
and 16.8. 


Homology 


The rather pedantic distinction between @ and +, and the other items discussed above, 
does not cause any great problems, because — as we now prove — it disappears when we 
pass to homology classes. But first, we must define homology: 


DEFINITION 16.13. Two cycles Q 1, Q2 in D are homologous, written Qy ~ Qo, if 
2,022. €B 


The homology class of Q is its ~-equivalence class: 


[Q] = {Q’:Q' ~Q} 


It is easy to see that [Q] is the coset of B in Z that contains Q. We therefore define: 
DEFINITION 16.14. The first homology group of D is the quotient group 
H\(D) =H = Z(D)/B) 
It is abelian since Z(D) is abelian. | 
We denote the identity element of 1 by 0. 
LEMMA 16.15. For all loops y,6 and orientation-preserving reparametrisations p, 


y@eor~yts 
oy = 
yep~y 
Proof. In each case the left- and right-hand sides differ by an element of B, by 
Proposition 16.11. Therefore they define the same element of 2/B = H. 


This lemma shows that the distinction between @ and +, and the other similar 
distinctions, disappear when we work modulo B. We therefore use the ordinary sym- 
bols +, — for cycles as well as loops, and stop using @,©. They have served their 


purpose. 
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LEMMA 16.16. Ify ~ y’/ and 6 ~ 6’ theny +6 ~ y/ +56". 


Proof. Both y — y’ and 6 — 6’ lie in B. Now add. 


REMARK 16.17. Homology ‘really’ works with the image / of a path rather than the 
path y : [a,b] — D. More precisely, what matters is the image and its orientation. 
Orientation is relevant because y and —y have the same image, but the integral along 
one is minus that along the other. We want y — y € B but not y + y ¢€ B to make 
integrals vanish on cycles in B. O 


Example 16.18. Let D = C \ {0}. Define y : [0,1] ~ Dby 


y(t) — ett 
and 6: [0,1] — Dby 
5(t) — JVerrit 
as in Figure 16.4 (left). 
We claim that y ~ 6. 
To prove this, introduce the path o = [1,2] and its reverse —o = [2,1], each 


parametrised by ¢ € [0, 1]. 
Now y — 6 is homologous to y +o — 6 —o because modulo B we have 


yto-—s-—oy—8+o0-o0=y+6+4+0=y+6 


Then we claim that y + o — 6 —o = 0 modulo B. This follows because y +0 — 6 —o 
is the boundary of a rectangle. Specifically, let R = [0,1] x [0, 1] and define a map 


¢@: [0,1] x [0,1] ~ D 
by 
b(r,s) = (1 +s)?" 


Then it is straightforward to check that dg = y +o — 6 —o, see Figure 16.4 (right). 
But now, modulo Bb, we have y — 6 =0, soy ~ 6. 


REMARK 16.18. It is possible to define the relation of homology using winding 
numbers. (Obtaining the homology group is less straightforward.) In this approach a 
boundary is defined to be any cycle Q such that w(Q, zo) = 0 when zo ¢ D. The proof of 
Cauchy’s Theorem still requires most of the same ideas, and we feel that this definition 
of homology loses contact with the original geometric ideas that inspired it. Eventually 
we will show that our definition is equivalent to the one based on winding numbers, but 
we have to prove Theorem 16.2 to do this. 


The following result is crucial: 
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Figure 16.4 Left: The paths y, 5,0, —o. Right: Mapping a rectangle so that its boundary is 
ytoa-—sé-a. 


THEOREM 16.19. Let f : D> C be differentiable on a domain D. Let Qy, Q2 be cycles 
in D. If Qy ~ Q2 then 
fr-fs 
Q Q 


Proof. Since Q1 ~ Q2 is equivalent to Q1 — Q2 € B, and the integral is additive, the 
statement is equivalent to 


ree Brnen f ¢=0 
Q 


Again using additivity, this follows if the integral over every generator of B is zero. But 
B is generated by boundaries of rectangles whose images lie in D, and the integral over 
the boundary of a rectangle is zero by Cauchy’s Theorem for a Boundary, Theorem 9.6. 


The winding number is a homology invariant: 


COROLLARY 16.20. If Qy ~ Q2 then 


w(Q1, 20) = W(Q2,z0) forallzo ¢D 


Proof of Cauchy’s Theorem, Homology Version 


We now give the proof of Theorem 16.2, the homology version of Cauchy’s Theorem. 
Cauchy’s Theorem for a Boundary, Theorem 9.6, gives the implication (i) => (ii). That 
(ii) = (1) follows from Theorem 16.9. So it remains only to prove that (ili) > (i). 
Recall that condition (i) states that Q is homologous to zero in D, or equivalently, Q € B. 
Condition (iil) states: 


w(Q,z0) = 0 forall z ¢D 
which we call the non-winding condition. 


Most of the proof that (iii) = (4) is an exercise in making careful distinctions between 
paths and their images, and using the relatively simple geometry of step paths to control 
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the topology. This does involve some rather pedantic distinctions, needed only for this 
proof. 
Let 


Q= Vo nyo; (16.5) 


for loops a; in D, and nj € Z. We may assume that the «; are distinct and that all nj 4 0. 
(if not, Q = 0 € B anyway.) 

We assume the non-winding condition and deduce that Q is homologous to zero. 

The idea of the proof is to simplify Q by repeatedly changing it to some cycle 2’ that 
is homologous to Q, and to use induction on some suitable measure of the complexity of 
the cycle. The definition of 2’ is chosen so that the complexity decreases at each stage. 
A simple result along these lines is Corollary 16.20, which implies that if Q ~ Q/ and 
zo ¢ D, then 


w(Q, 29) = w(Q", zo) 


Thus, if Q satisfies (iii) then so does Q’. Trivially, if Q’ satisfies (i) then so does Q. 
Therefore, if (iii) implies (i) for Q’, the implication also holds for Q. In other words: 


When proving (iii) implies (i) for Q we can replace it by any homologous 2’. 


Grid of Rectangles 


To construct suitable 2’ we proceed as follows. 

As in the proof of Lemma 8.6, we form a grid of rectangles. Extend each edge of Q 
(horizontal or vertical) to a line in C. These lines divide C into finitely many rectangles, 
and the lines cross at grid points, see Figure 16.5. Some of the ‘rectangles’ are infinite; 
these play no role as we shortly see. 

Let P be the set of all grid points that lie on Q, along with any comers or ‘ends’ where 
an edge of terminates without continuing. (The corresponding path ‘doubles back’ at 
such a point. In the generality employed here, this can happen.) 


Figure 16.5 Rectangles and grid points. Here Q is a sum of three loops. 
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We use P to split the w; into sums of paths, such that the image of each path is an edge 
of some rectangle. Let the parameter values of w; be t € [a,b]. Since a; is a loop we can 
change parameters if necessary so that wj(a) € P. (Start at a corner or some other grid 
point, and note that a change of parameter is a homology.) Then the set of ¢ for which 
«;(t) € P (where the path goes through a point in P) is a finite set, and can be arranged 
in order as 


a=to<t) <::'<t_1<t=b 
Let wo; be the restriction of a; to [t,-1,4% +1] fork = 1,...,n. Then 
1 


We call each or a segment of Q. Its image or is an edge of some rectangle, and by 


definition or G ren Moreover, ws has an orientation in the direction of increasing f¢. 
Thus we can think of wf as an oriented edge of a rectangle. 


Now Q is (homologous to) a sum of segments, over all OF, and Q is the union of their 
images. 


Example 16.22. Consider Figure 16.6. Here Q is a single loop, but this overlaps itself 
several times. Its path is indicated by the grey polygon, and the loop visits P in the order 


ABEDGHIFCBEHGDEBA 


Therefore 


Q = [A,B] + [B, E] + [E,D] + ---+ [B, A] 


Several segments occur more than once, sometimes in reverse orientation. For example 
AB and BA occur, and BE occurs twice, along with its reverse EB. In such cases, the 


image of Q overlaps itself. 


We have now changed Q by a series of homologies into a sum of integer multiples of 
segments. (To avoid proliferation of dashes, we continue to call the changed cycle Q.) 


B 
> i ° ~<4 | ied 
v 
A v 
A 
. + ~<?— ee D 
I v 
A v 
A 
o  ~« or 40 


H G 


Figure 16.6 A loop with several overlaps. 
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Collect together all segments with the same image. Note that if o is a segment then 
o+(-0)~o-o0~0 


since it is a boundary; indeed a generator of type (ii). So we can cancel these two seg- 
ments without changing the homology class of Q. Since o + (—o) is a loop, the rest of 
Q remains a cycle. That is: 


Without loss of generality, Q contains no pair of opposite segments. 
Therefore we can write 


Q= > mo; 
j 


where the oj are distinct segments, oj and —o; do not both occur, and the m; are non- 
zero integers. (There is one exception: all mj; may be zero. But then Q € B and we are 
finished.) 

For induction purposes we require: 


DEFINITION 16.23. The complexity of Q is 
c(Q2) = number of segments + number of relevant rectangles 


Here, as in the proof of Lemma 8.6, a rectangle R, is relevant if w(Q,z,-) = 0 for z; at 
the centre of R;. All infinite rectangles are irrelevant, and some finite ones may also be 


irrelevant. 


Observe that cancelling opposite segments decreases the complexity, so this ‘without 
loss of generality’ step in the proof does not affect induction based on the complexity. 


Proof of Theorem 16.2. 


We prove that (iii) implies (i) by induction on the complexity c(Q). 

The induction starts with c(Q) = 0. Then Q is the empty cycle. This corresponds to 
the zero element 0 € C, and since B is a subgroup of C, we have 0 € B. So the empty 
cycle is homologous to zero. This starts the induction. 

For the induction step, take Q with c(Q) > 0. Define A to be the top right point of P. 
That is, 


dG) imA > imB forallB e P 
(ii) re A > reB for all B € P such thatimB = imA 


Since P is finite its top right point exists and is unique. 
We consider the geometry of 92 ‘near A’. There are three possibilities: 


(i) A lies on a vertical edge of Q but not on any horizontal edge. 
(ii) A lies on a horizontal edge of Q but not on any vertical edge. 
(iii) A lies on a horizontal edge of Q and on a vertical edge. 
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=) 
e> 


Figure 16.7 Three possible cases for the geometry near A. Dotted lines indicate possible 
connections to the rest of the cycle. 
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Figure 16.8 Rerouting segments to get rid of R. 


Figure 16.7 shows these three possibilities, where A, B, D € P. 

In cases (i) and (ii), there must be a segment BA in that orientation, and another with 
the reverse orientation AB, because Q is a cycle. These two segments cancel, contrary 
to our previous assumption, so case (i) cannot occur. Similarly, case (ii) cannot occur. 
So only (iii) is possible. 

Orient AB from A to B and DA from D to A. Then all segments in Q with image AB 
occur m times, where 0 4 m € Z. If m > 0 they have orientation AB; if m < 0 they 
have orientation BA. Since Q is a cycle, the segments of Q with image DA must also 
occur m times, where 0 4 m € Z. If m > 0 they have orientation DA, of m < 0 they 
have orientation AD. (Any loop that arrives at A must leave A.) 

Let C be the fourth corner of the rectangle R = ABCD. Then A, B, C, D are grid 
points and A, B, D € P. The geometry on R is shown in Figure 16.8 (left), with m 
indicating |m| copies of a segment in the direction of the arrow for m > 0, and in the 
opposite direction for m < 0. 


Rerouting Segments 


We now claim that by using a homology we can re-route the segments of Q that lie in 
DAB to corresponding segments in DCB, as in Figure 16.8 (right). To set this up, we 
pause from the proof of Theorem 16.2 to prove: 


LEMMA 16.24. With the above notation, if z lies in the interior of R then 
w(Q,z) =m (16.6) 


Proof. Draw a vertical ray A from z upwards, to infinity. This cuts AB at one (interior) 
point X, as in Figure 16.9. 
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Figure 16.9 Rectangle R and ray 4. Shaded area indicates half-plane containing Q>. 


Let Q, be Q with all segments in DA and AB removed (this is not a cycle). Let Q, be 
Q plus the rerouted segments in DC and CB (all of multiplicity m). Then Q2 is a cycle. 
We claim that 


Qo ~ Q 


Segments of Q lying in AB cut A m times (with AB oriented to the left). The cycle 
Q2 does not intersect A because A is the top right point of Q, so Q2 lies in the half-plane 
below AB. 

By continuous choice of argument, as in Section 7.4, 


w(Q22,z) = 0 (16.7) 
Let 
5=mdRE Z 
Clearly 
Q = Q2 +6 


because segments of the right-hand side on DC or CB cancel, and the other segments of 
6 restore the segments on DA and AB. 

We now face a minor technical problem: we do not (yet) know that DC and CB lie in 
the domain D. We therefore need to prove this, as follows. 

If z is in the interior of R, then 


w(6,z) =m 


by continuous choice of argument. AB crosses A only at X, where it crosses m times in 
the anticlockwise direction. 
Therefore if z is in the interior of R, 


w(Q, Z) = w(Q2 + 6,z) = W(Qz,z) + WS, 2) =O+m=m 40 


using (16.7) and additivity of w. 
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Resumption of Proof of Theorem 16.2. 


By (iii), which we are assuming, we have z € D. Therefore the interior of R lies in D. 
If BC € Q then BC C D. Otherwise, continuity of the winding number at points not 
in Q implies that w(Q, z) = 0 if z lies on the interior of BC. The same goes for CD. 
Finally, suppose that z = C. If C lies in Q then z € D. If not, we again appeal to 
continuity of w to deduce that w(Q,z) = m # 0. As above, z € D. So the whole of R, 
interior plus boundary, lies in D. But now 


56=mdR~ 0 
since OR is a boundary, so lies in B. We finally conclude that 
Q ~A2-68~Q 


The complexity c(Q2) < c(<2). The number of segments does not change, but we have 
removed the rectangle R. This is relevant since w(Q, zr) = m 4 0, where Zp is the centre 
of R. On the other hand, it is easy to see that no other rectangle can be irrelevant for Q2 
but relevant for Q. The reason is that if zz is the centre of rectangle Rx, the continuous 
argument definition of w implies that 


1 if k=l 


WBoad=| 4 if k4l 


Additivity then proves that for all rectangles Rx except R we have 


w(22, ZK) = (Q, zx) 


Thus we have removed one relevant rectangle but not introduced any new ones, and 
c(Q2) < c(Q) as claimed. 
Inductively, Q2 ~ 0. But Q ~ Q2 so Q = 0. This completes the induction step, 


proving Theorem 16.2. 


Cauchy’s Residue Theorem, Homology Version 


We can now use Theorem 16.2 to prove a homology version of Cauchy’s Residue 
Theorem, Theorem 12.3. 
First: 


LEMMA 16.25. Let Q), Q2 be cycles in D such that 
w(Q1, z) = w(Q2, z) for all z ¢ D 
Then 
Q1 ~ Qe 
Proof. For all z ¢ D, 
w(Qy — Qo, z) = w(Qy, z) — w(Q2,z) = 0 


By Theorem 16.2, Q) — Q2 ~ 0. So Q) ~ Q2. 
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Suppose z1,...,2, are distinct points in C, and define the k-times punctured plane to 
be 


D=C \ {z,..., zk} 
It is easy to see that Dis a domain. (Recall: this means open and connected.) 
THEOREM 16.26. If D is the k-times punctured plane, then 
H\(D)=Z'=Z@---@®Z (ktimes) 


Proof. We do more: we find independent generators for H,(D); that is, elements such 
that any element is a unique integer combination of them. 
Let Q be a cycle in D. Then forj = 1,...,k, no z; lies on Q. 
Since Q is a closed set, its complement is open, so there exists ¢ > 0 such that for all 
P= Iencask 
NG) NQ=B 
Let Rj be a square (with horizontal and vertical sides) of side ¢/2 centred at z;. Then 


Rj SC Ne(z) So RF Q =. (We use a square to make 0R; a step path.) 
Define cycles 


pj = OR; 
oriented anticlockwise. Then clearly 
1 if j=l 
w( js 21) = ae Awe (16.8) 


using a suitable cut plane and a continuous choice of argument. Let 
m; = w(X, z;) 
We claim that 
Q~ Pi mp; 
This follows because 
w(Q, z1) = mi = w oy mp}, 21) 


by additivity and (16.8). Now apply Lemma 16.25. 
We also claim that the m; are unique: no other Z-linear combination of the p; is 
homologous to Q. Equivalently, we must show that if 


p= > mp ~0 (16.9) 


then all mj = 0. 
Suppose (16.9) holds. Then w(p, z) = O for all z ¢ D by Theorem 16.2. That is, 
w(p,z;) = 0 for all / = 1,...,k. But by Proposition 16.8, 


w(p, 7) =w o> mip}, 2) =m 
Thus m; = 0 for all? = 1,...,k. 
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Clearly every p is a cycle, so the map 


Q = (m),..., Mp) 
induces an isomorphism 
H,(D) > Z* 
We say that the homology class of Q is the corresponding (mj,...,m,) € ZK. 


THEOREM 16.27 (Cauchy’s Residue Theorem, Homology Version). Let D be ak-times 
punctured plane and suppose that f : D — C is differentiable. Let Q by a cycle in D of 
homology class (m,,...,mk) € ZE. Then 


k 
/ f= 2i Y mjres (f,z)) 
Q ai 


Proof. 2 ~ pa mjp;, SO 


fy-dfy 


Since f is differentiable, we can apply Theorem 12.3, Cauchy’s Residue Theorem for a 
loop, to obtain 


f = 2mires (f, z) 
Pj 
We could go on to state a more general version of Cauchy’s Residue Theorem for 
other domains, such as C with a discrete infinite set of points removed, or a disc 
with finitely many points or discs removed. We could also consider analogues for Rie- 
mann surfaces. But it seems sensible to stop here, having opened up the possibilities of 
homology and used it to prove two powerful theorems in complex analysis. 


Exercises 


1. Let D = {ze C: 1 < |z| < 2}. Prove that H1(D) = Z. 

2. Let D be an open disc with k distinct points removed. What is H,(D)? Prove your 
answer correct. 

3. Let Dbe C with k disjoint closed discs removed. What is H,(D)? Prove your answer 
correct. 

4. Suppose that K = {z, : k € N} is an infinite set of distinct points, and that K is 
discrete, that is, for each k there exists € > O such that N,(z,) NK = {z;}. Prove a 
version of Cauchy’s Residue Theorem for the domain D = C \ K. 

5. Construct a closed step path in C \ {0} that is homologous to zero, whose 
complement has four distinct connected components. 
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6. The Oth homology group Ho(D) can be defined in a manner analogous to that for 


10. 


11. 


H(D), as follows. A O-cycle is a finite formal sum of pairs of (not necessarily 
distinct) points in D. A 0-boundary is a finite formal sum of pairs of points in D that 
can be joined by a step path in D. 

Show that 0-cycles form a group Z and 0-boundaries from a group Bp. Define 
Ho(D) = Zo/Bo. 

Show that the pair (z1,z2) is a 0-boundary if and only z; and z lie in the same 
connected component of D. Deduce that Ho(D) is generated by all cosets of B of 
the form (zx, zx) + B, where one point zx € C is chosen for each connected com- 
ponent K. (That is, Ho(D) is isomorphic to the free abelian group on the connected 
components.) In particular, if D has a finite number k of connected components, 
prove that Ho(D) = Z 


. Prove the Dog Walking Theorem: If 01, 02 are loops, both parametrised by ¢ € [a, b], 


and there is a point zo such that 
lo, (t) — 02(t)| < |o1() — zo| for all t € [a,b] 
then 
W(01, 2%) = w(2, Zo) 


and 0; ~ o2 in C \ {zo}. 

Explain the name of this theorem. (Hint: o; is the man, o> the dog, and zg is a 
tree.) 

Use it to give another proof of Rouché’s Theorem 12.14. 


. Jordan Contour Theorem for Loops. Recall that a loop is a closed step path. Say 


that a loop o : [a, b] — C is simple if it does not cross itself; that is, o(t1) = o(t2) 
with 4, At € [a,b] only when the ¢; equal a or b. 

Use the trick in the proof of Theorem 16.2, of rerouting the segments of a cycle 
(here one loop) round the rectangle R adjacent to the top right point of 6 to prove 
that: 

(i) C \ & has exactly two connected components: one bounded, the other 
unbounded. 
(ii) Let I(o) be the bounded component in (i) and O(c) be the unbounded 
component. Prove that w(o, z) = +1 if z € (oc) and w(o, z) = Oif z € O(o). 
(iti) 0I(o) = OO(o) =G. 


. If o1, 02 are simple loops and 6; C J(o2), prove that [(o,) C I(o2) and O(o,) D 


O(02). 


Suppose that a simple loop o lies in a domain D. Prove that there exists ¢ > 0 such 
that |z — o(ft)| < e for ze C and some t € [a, b] implies that z € D. (Hint: use the 
Paving Lemma.) 


Using the result of Exercises 9 and 10, prove that if o is a simple loop in D there 
exists € > 0 such that there are simple loops o/,o9 such that: 
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12. 


13. 


14. 


(i) Every point of ¢° U 6! lies within distance ¢ of some point of 6. 

(ii) o lies in O(c) and o! lies in I(c). 
(iii) The regions I(a°) \ (6 UI(a)) and I(o) \ (6! U I(o")) are connected. 
If D is a domain (hence connected) and o is a simple loop in D, prove that both 
DN O(c) and D NI(c)/ are connected. 


Define the edges of a loop as follows. Define a corner to be a point at which the 
image of the loop switches between horizontal and vertical. Assume that o(a) is 
a corner, and let Q be the set of t € [a,b] such that o(f) is a corner. Arrange the 
elements of Q in order: 


a=to<t)<:::-<t,=b 


Then an edge is the image of o restricted to some interval [4—-1, 4] forj = 1,...,n. 
Define the edges of a cycle to be those of its component loops. 
Say that a cycle o is in general position if no two edges are collinear (that is, 
both lie in the same horizontal or vertical line in C). 
Prove that every cycle in a domain D is homologous to a cycle in general position. 
(Hint: inductively displace successive edges parallel to themselves through a 
small distance, so that the displaced edge differs from the original edge by the 
boundary of a rectangle lying inside D. Fix up the geometry at corners, where as 
stated there may be a gap or a small overlap between the displaced edges.) 


Show that any non-zero cycle is homologous to a sum of disjoint simple loops o;,, 
each occurring in the sum with multiplicity +1. (Hint: use Exercise 13 to put it in 


Figure 16.10 Left: Partial ordering of loops (here shown as ellipses, not step paths, for clarity) by 
containment of interiors. Right: Cancelling adjacent oppositely oriented loops by making a cut. 
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general position, then reroute edges near crossing points to eliminate the crossings, 
using small rectangles inside D.) 


15. Using induction on the number of simple loops in such a decomposition, obtain 
a different proof of Theorem 16.2, which is modelled along the classical lines of 
simplifying loops by ‘making cuts’ that cancel each other out. See for instance 
Figure 11.2. 

Warning: This exercise is hard, but the following sketch may help. Think of it as 
a mini-research project. 

First, show that disjoint simple loops are partially ordered by the relation 0, < 
02 <=> [(o1) € [(o2). This ordering can be complicated, see Figure 16.10. 

Show that if (411) holds then there must be two simple loops 01 < o2 with opposite 
orientations, such that there is no simple loop 03 with o1 < 03 < o2. Using some 
of the exercises above, show that there is a step path t joining a point on oj toa 
point on o7 that lies in D and intersects no other oj appearing in the decomposition. 
Use t to write 


op to2.=p+t 


where p is a simple loop in D and t’ is a simple loop in D that is homologous to 
zero, see Figure 16.9. This reduces the number of simple loops by 1, and induction 
reduces to the case of one simple loop o . Show that J(c) lies in D and complete the 
proof using property (iii). 
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The Road Goes Ever On... 


The end of this book is fast upon us, but there is as yet no discernible end to com- 
plex analysis itself. It remains a vigorous and growing part of mainstream mathematics. 
We sample this rich area by giving brief descriptions of five areas of current research: 
the Riemann Hypothesis, modular functions, several complex variables, complex 
manifolds, and complex dynamics. 


The Riemann Hypothesis 


One of the more unexpected applications of complex analysis occurred in the second 
half of the nineteenth century, leading to major advances in number theory. At first sight 
these two areas have little connection: complex analysis is about continuous quantities 
and number theory is discrete. Even more surprisingly, the connection that emerged did 
so in the context of statistical properties of prime numbers. The starting point was a new 
kind of complex function, based not on power series, but on Dirichlet series of the form 


oo An 
WO= DoF 


where the a, and s are complex numbers and the domain is restricted to make the series 
converge. For example, if the a, are bounded, we require res > 1. More strongly, if the 
partial sums a; + --- + ad, are bounded, we require only res > 0. Classical concepts 
such as the gamma function (and, later, more sophisticated new ideas) were then used 
to extend the definition of ¢ to the entire complex plane, except for isolated poles. 

Prime numbers are of vital importance in mathematics, yet they notoriously lack any 
clear pattern. Given a list of all primes up to some particular number, there seems to 
be no simple way to predict the next one. However, hints of regularity emerge when we 
think about entire ranges of primes. For instance: how many primes are there up to some 
specified limit x? It is difficult to answer this question exactly, but good approximations 
are another matter. In 1797-8 Adrien-Marie Legendre counted how many primes occur 
up to various limits, using tables of primes provided by Jurij Vega and Anton Felkel. 
Denoting the number of primes less than x by 7 (x), as we now do, he observed empiri- 
cally that 2 (x) seems to be close to x/(log x — 1.083 66). In a letter of 1849, Gauss wrote 
that when he was about 15 he noticed that 2 (x) is approximately x/ log x for large x. 
In 1838 Dirichlet wrote to Gauss to tell him he had found a similar but more accurate 
approximation to z (x), the logarithmic integral 
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* dt 


Li(x) = aie 
iQ) 90 logt 


For example, when x = 10°, 


(x) = 50 847 534 
Li(x) = 50849 234.9 
x/logx = 48 254942.4 


The approximations here are asymptotic: that is, the ratio of the approximate formula to 
the true value z (x) was conjectured to tend to 1 as x — ov. The error in an asymptotic 
formula can be quite big; it just has to be significantly smaller than the exact quantity. 

All of these observations were purely empirical, and a rigorous proof that these for- 
mulas are asymptotic to a(x) became a major open problem in number theory, called 
the Prime Number Theorem (at a time when ‘theorem’ was also used to refer to a con- 
jecture). The eventual proof of the Prime Number Theorem relied on deep applications 
of complex analysis. 

The relation between these apparently disconnected areas of mathematics goes back 
to Euler, who realised in 1737 that uniqueness of prime factorisation has an implication 
for real analysis, namely the formula 


<i 1 
~s=II- 


—s 
n=1 P P 


where p runs through the primes. To prove this, observe that summing a geometric series 
gives 
1 1 1 1 


l-p? 1 / ps "pes / 


Expand the product on the right-hand side, and observe that each term 1/n* appears 
exactly once as | divided by the sth power of a product of prime powers. 

Euler considered only positive integer s, but in 1848 and 1850 the Russian mathemati- 
cian Pafnuty Chebyshev attempted to prove the Prime Number Theorem by applying 
real analysis to Euler’s series, assuming s to be real and greater than 1, so that the series 
converges. He managed to prove that for large enough x the ratio of 7(x) to x/ log x lies 
between two constants: one slightly bigger than 1, the other slightly smaller. 

Riemann realised that real analysis was too limited to complete the proof, but complex 
analysis was more powerful and might do the trick. He noticed that Euler’s series con- 
verges for any complex s with res > 1. Its sum, called the zeta function ¢ (s), is complex 
analytic. He wrote up his ideas in 1859. They included an explicit formula expressing 
qt (x) exactly in terms of the zeta function. To state it, Riemann defined a related function 


M(x) = (x) + smal?) + acs!) + pres fees 


which counts the prime powers up to x. From this it is easy to deduce 2(x). Then he 
proved that 
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ve dt 
= 4 = (+P | 
T(x) = Li(x) » Li’) 4 [ = leat 


where the sum is over all zeros of the zeta function, excluding negative even integers. 
(We explain where those come from in a moment — their role is not apparent in the 
formula for the infinite series.) The zeros and poles of a complex analytic function 
are important because they characterise the function completely if we also know their 
orders. It turns out that the zeros of the zeta function include all negative even integers, 
along with infinitely many others. 

In 1896 Jacques Hadamard and Charles Jean de la Vallée-Poussin independently 
proved the Prime Number Theorem using Riemann’s zeta function, a triumph for what 
became known as analytic number theory — the application of complex analysis to prop- 
erties of whole numbers. This approach has now blossomed into a huge area of number 
theory. 

Riemann took the crucial step of using analytic continuation to extend the definition 
of ¢(s) so that s can be any complex number except 1. This step depends on the gamma 
function, introduced in Section 6.11, which is a generalisation of the factorial function 
n! of a whole number n € N. 

To extend the domain of definition of the zeta function, we first note that the series 
for ¢(s) converges only when rez > 1. However, it is easy to verify that 


1 2 Ce 1 1 
( =) 1D» Qs * 38 


The series on the right converges for rez > 0, so the formula lets us extend the definition 
of the zeta function to such z, and it remains an analytic function of z. Then, starting 
from a formula of Adolf Hurwitz, Riemann deduced the functional equation for the zeta 


function: if 0 < res < 1, then 


¢(s) = 2a! sin (> ) rd —s)¢d —s) 


where I" denotes the gamma function, Section 6.11. This formula can then be used to 
define ¢(s) for all complex z 4 1, by assuming it holds whenever 0 < res, giving an 
analytic continuation of the zeta function to C \ {1}. The same formula shows that 


¢(-2n) =0 n=1,2,3,... 


because the factor sin (5) vanishes at these numbers. The negative even integers are 
the trivial zeros of the zeta function. 

In his paper, Riemann [16] observed that there also exist non-trivial zeros of the zeta 
function — those that are not negative even integers. In particular he found zeros at 


514.1351 8 =5421.0221 5+425.011i 


but did not publish these results. His calculations suggested that all non-trivial zeros 
seem to have real part a This statement is now known as the Riemann Hypothesis. (His 
actual statement was about a closely related function and is equivalent to this.) He wrote: 
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One would, however, wish for a strict proof of this; I have, though, after some fleeting futile 
attempts, provisionally put aside the search for such, as it appears unnecessary for the next 
objective of my investigation. 


As time passed, it became clear that the Riemann Hypothesis is of central impor- 
tance in mathematics. It implies strong bounds on the error in the Prime Number 
Theorem — the order of magnitude of the difference between the approximate formula 
and z(x). For example, in 1901 Niels Helge von Koch showed that if the Riemann 
Hypothesis is correct then the error |Li(x) — 2 (x)| grows no faster than a constant times 
,/xlog x, and in 1976 Lowell Schoenfeld gave the explicit bound 


1 42 
\Li(x) — m(x)| < a log x (x > 2657) 


The Riemann Hypothesis also tells us about the size of gaps between consecutive 
primes. Better still, there are far-reaching generalisations with major implications 
throughout number theory, and it has applications to testing numbers to see whether 
they are prime, which is important for encrypted Internet communications. There is 
a lot of circumstantial evidence in favour of the Riemann Hypothesis, and computer 
calculations have verified it for the first ten trillion zeros. 

Despite a huge amount of effort, however, the Riemann Hypothesis has neither been 
proved nor disproved. Hilbert included it in his famous list of unsolved problems in 
1900. The Clay Mathematics Institute is currently offering a prize of one million dollars 
for solutions to six major unsolved problems in mathematics, and the Riemann Hypoth- 
esis is one of them. A seventh, the Poincaré Conjecture, was solved in 2002-3 by Grigori 
Perelman, who declined the prize. 


Modular Functions 


Extending the idea of periodic functions (such as exp) leads to doubly periodic func- 
tions, which satisfy f(z) = f(z + p) = f(z +g) for two complex numbers p, g that are 
linearly independent over the reals (not real multiples of each other). These are also 
known as elliptic functions because some of them can be used to give a formula for the 
arc-length of an ellipse. 

Further generalisation leads to modular functions, which transform in a nice way 
under a discrete group of Mobius maps. Here ‘discrete’ means that each transforma- 
tion is separate from the others rather than being part of some continuous family. For 
example, Z? is a discrete subgroup of R?. 

These functions preoccupied many mathematicians at the end of the nineteenth cen- 
tury. These united in one package group theory, differential equations, algebraic function 
theory, topology, and complex analysis. This is still an important field of research. For 
example, modular functions are heavily involved in Andrew Wiles’s celebrated proof of 
Fermat’s Last Theorem. 

The starting point is the modular group, which consists of all Mobius maps 


az+b 


Mz) = 


cz+d 
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for which a,b,c,d € Z and ad — bc 4 0. This group is also called the special linear 
group over Z, denoted SL2(Z). If H denotes the upper half-plane {z € C : imz > 0}, 
then (subject to some technical conditions omitted here) a modular function is a function 
f :H— C that is analytic on H and satisfies the condition 


f(u) = (cz +d) f(2) Vz €H 


for any 2 € SLo(Z), where the weight k is an odd positive integer. 

Generalisations of these functions, known as modular forms, have many applications 
to number theory. Wiles deduced Fermat’s Last Theorem from his proof of a long- 
conjectured relationship between modular forms and certain number-theoretic equations 
called elliptic curves because of the connection with elliptic functions. An ellipse is not 
an elliptic curve. 


Several Complex Variables 


Functions of several complex variables may be studied. They turn out to be far trickier 
than we might expect, with startling new phenomena. 
A function of several complex variables is a map 


f:D-C 


where the domain D is an open subset of C” for some n € N,n > 2. We require f to 
be analytic, in the sense that for each a = (a),...,a,) € U there is a convergent power 
series expansion 


PCy 0 0) = D> Bley adn 1 — 21) +++ Gp — any 


There is an equivalent condition analogous to the Cauchy—Riemann Equations, which 
can be proved using a form of Cauchy’s Integral Formula. A more common term in this 
context is that such an f is holomorphic on U. 

Some of the basic theorems and concepts of complex analysis generalise to several 
variables, but many others do not. Analytic continuation has a natural definition, and 
is again unique but possibly multivalued. One of the most influential new discoveries 
is the Hartogs’ phenomenon, named after Friedrich Hartogs. In one-variable complex 
analysis, there are analytic functions with a natural boundary — a closed curve beyond 
which they cannot be analytically continued. Equation (14.5) gives an example. Hartogs’ 
extension theorem of 1906 states, roughly speaking, that no such thing occurs for two or 
more variables. The domain on which a function is holomorphic is always unbounded. 

Hartogs proved more. In one-variable complex analysis, for any domain D there exists 
an analytic function f : D — C for which the boundary dD is a natural boundary. 
Hartogs proved that if 


A? = {((a1, 22) € C? : [zi] < 1, lel < 1} 
Ae = {(Z1, 22) € ge zi] < eorl —€ < |z2]} 


for 0 < € < 1, then any function f holomorphic on H, has a holomorphic extension F 
to A. Indeed, F can be defined using Cauchy’s Integral Formula. 
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The theory was developed by Hartogs and Kiyoshi Oka in the 1930s. Hartogs proved 
that every isolated singularity is removable when n > 2, another crucial difference from 
the one-variable case. The subject really began to take off in 1945 with work of Henri 
Cartan, Hans Grauert, and Reinhold Remmert. It soon evolved into the more general 
context of complex manifolds, which we consider next. 


Complex Manifolds 


The Riemann surface alone has opened up broad vistas, by capturing the essential struc- 
ture of a complex function in a single geometric object. All kinds of information, such as 
the presence of branch points and singularities, can be read off from it — and the ‘rigid- 
ity’ of analytic functions means that they are essentially determined by the position 
and nature of their singularities. Many questions that seem baffling without Riemann 
surfaces become transparent when this extra geometry is invoked. 

The notion of a complex manifold arises when we generalise Riemann surfaces to 
several complex variables. The basic definition is modelled on real manifolds: these are 
multidimensional analogues of surfaces, obtained by patching together open subsets of 
IR” whose coordinates are smoothly related when the patches overlap. In the complex 
case we use open subsets of C” whose coordinates are analytically related when the 
patches overlap. 

In differentiable topology a smooth real manifold may have several distinct differen- 
tiable structures, but in all dimensions except 4 the number of such structures is finite. 
However, a fixed manifold of even dimension can possess infinitely many different com- 
plex structures. These can be classified by associating each complex structure with a 
point in a so-called moduli space which is itself a complex algebraic variety — roughly, 
the set of zeros of a system of polynomials in several complex variables. 

The whole area is deeply related to topology and to algebraic geometry, and we will 
not attempt to summarise it. Because all of these areas are highly abstract, until recently 
much of this work was viewed by most non-mathematicians (among those aware of 
its existence) as mere generalisation for its own sake — pretty, intellectually clever, but 
far too wild and abstract to have sensible applications outside pure mathematics. Such 
judgements are usually superficial and premature when they refer to the mathematical 
mainstream: most ideas that are important in pure mathematics eventually acquire sig- 
nificant uses. And so it has proved here. Complex manifolds and automorphic functions 
are now central to the physics of Quantum Field Theory and the study of gauge fields in 
particle physics, for instance. 


Complex Dynamics 


Let f : Z — Z be a map from a set Z to itself. Then we can apply f repeatedly to any 
Zo € X to get the sequence 


zo, f(zo), fF (Zo), FF FZo))),- -- 


This process is classically known as iterating f. It can be defined inductively by 
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The modern term is discrete dynamical system. If we think of the subscript n as ‘time’, 
ticking in whole number steps, the sequence (z,) specifies how an initial state x9 changes 
over time. Usually the set Z is a smooth manifold — a multidimensional analogue of a 
smooth surface. A central aim of dynamical systems theory is to understand sequences 
of this kind, especially their long-term behaviour. 

Complex dynamics studies discrete dynamical systems when Z = C (or some sub- 
set of C) and f is differentiable. The first serious work on this topic was carried out 
by Pierre Fatou in 1917 and Gaston Julia in 1918. Their methods were analytic. With 
the advance of computer graphics it became possible to draw accurate pictures, which 
revealed surprisingly intricate shapes of great beauty. The associated mathematics is also 
very beautiful, from a logical point of view. However, many interesting and important 
problems remain open at this stage. 

For this brief discussion, we restrict attention to polynomial f. Analogous concepts 
can be studied for general differentiable f, but some definitions are more complicated. 
See Devaney [3] for the details. A key concept is the Julia set of f. This can be char- 
acterised in several equivalent ways, by proving appropriate theorems. Perhaps the 
simplest is to distinguish two types of behaviour for the sequence (z,,). Depending on Zo, 
either the set of all z,» remains bounded for all n € N, or it diverges to infinity. The Julia 
set J(f) is the boundary of the set of zo for which it diverges to infinity. For polynomial 
f this is also the boundary of the set of zo for which it remains bounded. The Julia set is 
invariant under f, that is, f—'(J(f)) = ff) = J(f). 

For example, if f(z) = 2? then J(f) is the unit circle. On J(f), the map f sends e to 
e719 doubling the angle. If f(z) = z* — 2 it can be proved that J(f) is the line segment 
[—2, 2], and the dynamics of f on this interval is more complicated. 

The geometry of Julia sets is fascinating, even for very simple maps f. It is best 
understood (and even then not completely) for quadratic maps 


fOQ=fD=2+e cec 


Sometimes J(f-) is quite simple. For instance, if c < : then J(f-) is a simple closed 
curve. For other values of c it is a highly intricate fractal, as in Figure 17.1. For some c 
the Julia set is a disconnected dust cloud; for others it is connected. 


Figure 17.1. Two examples of Julia sets. 
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Figure 17.2 Left: The Mandelbrot set. [From Wikimedia Commons under the GNU Free 
Documentation License.] Right: Close-up of the boundary of part of the Mandelbrot set. 


To bring order to the Julia sets, we consider how their topology depends on the con- 
stant c € C. The Mandelbrot set M © C is defined to be the set of c for which J(fc) is 
connected. Figure 17.2 (left) shows the Mandelbrot set: it is the black cactus-like region. 
The boundary of the Mandelbrot set is infinitely complicated — a small part is shown in 
Figure 17.2 (right). Numerous images can be found on the Internet, including movies 
that zoom into the set at ever greater magnification. 

Near any point c the Mandelbrot set looks like the Julia set J(f.). It has been proved 
that the Mandelbrot set is connected. It is conjectured to be locally connected (look up 
the definition!) but this conjecture remains open as we write. Analogues for other types 
of differentiable complex function have been studied, and many problems remain open. 


Epilogue 


In a sense, the wheel has come full circle. The contour of history has wound round 
the singularity of mathematical discovery, and closed. More precisely, it has returned to 
its starting point but climbed one level up the Riemann surface of scientific advances. 
In its early days, complex analysis was (almost) a branch of physics; the connections 
with potential theory and fluid mechanics were widely exploited. Towards the end of the 
nineteenth century Felix Klein offered a ‘proof’ of a theorem along the following lines: 
think of the Riemann surface as being made of thin metal, and an electric current flowing 
through it... . Today this would not be considered a logically convincing argument, but 
the physical intuition behind it led to some important mathematical discoveries. Now we 
are witnessing the converse process, with mathematical intuition leading to important 
ideas in physics. There is a two-way trade between mathematics and its applications. 
And whatever the attractions of beauty for its own sake, this trade is vital for the health 
of both mathematics and science. 
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Praise for the first edition: 


‘The authors strike a pleasant 
balance between informality 
and rigour and the reader is 
always kept well informed of 
the general picture ... ithas 

a large number of worked 
examples and exercises and 

| recommend it as a text for 
students.’ 

A. F. Beardon, Bulletin of the 
London Mathematical Society 


‘lan Stewart and David Tall 

are excellent mathematical 
communicators, and 

their expertise in both 
mathematics and mathematics 
education shines out of this 
book. It provides not only 
clear explanations of the 
technicalities but also a wealth 
of visual representations from 
which students can develop 
intuition for the meanings of 
the key definitions and for the 
deep connections between the 
results. Complex analysis is a 
beautiful subject, and there is 
no-one better to learn it from.’ 
Lara Alcock, 

Loughborough University 
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This new edition of a classic textbook develops complex analysis from 
the established theory of real analysis by emphasising the differences 
that arise as a result of the richer geometry of the complex plane. 

Key features of the authors’ approach are to use simple topological 
ideas to translate visual intuition into rigorous proof, and, in this 
edition, to address the conceptual conflicts between pure and applied 
approaches head-on. 

Beyond the material of the clarified and corrected original edition, 
there are three new chapters: Chapter 15 on infinitesimals in real and 
complex analysis; Chapter 16 on homology versions of Cauchy’s 
Theorem and Cauchy’s Residue Theorem, linking back to geometric 
intuition; and Chapter 17 outlines some more advanced directions in 
which complex analysis has developed, and continues to evolve into 
the future. 

With numerous worked examples and exercises, clear and direct 
proofs, and a view to the future of the subject, this is an invaluable 
companion for any modern complex analysis course. 
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